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<220> 
<221> 
<222> 
<223> 



1 

1160 
DNA 

Rhodosporidium toruloides 



CDS 

(1) . . (1104) 
coding for DAAO 



<400> 1 

atg cac teg cag aag cgc gtc gtt gtc etc gga tea ggc gtt ate ggt 
Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 
15 10 15 

ctg age age gee etc ate etc get egg aag. ggc tac age gtg cat att 
Leu Ser Ser Ala Leu lie Leu Ala Arg Lys Gly Tyr Ser Val His lie 

20 25 30 



48 



96 



35 etc gcg cgc gac ttg ccg gag gac gtc teg age cag act ttc get tea 
Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
35 40 45 



144 



cca tgg get ggc gcg aat tgg acg cct ttc atg acg ctt aca gac ggt 
40 Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 

50 55 60 



192 



cct cga caa gca aaa tgg gaa gaa teg act ttc aag aag tgg gtc gag 24 0 

Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu 
45 65 70 75 80 
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ttg gtc ccg acg ggc cat gcc atg tgg etc aag ggg acg agg egg ttc 288 
Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Arg Phe 

85 90 " 95 

5 

gcg cag aac gaa gac ggc ttg etc ggg cac tgg tac aag gac ate acg 33 6 

Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp lie Thr 

100 105 110 

10 cca aat tac cgc ccc etc cca tct tec gaa tgt cca cct ggc get ate 38 4 

Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala lie 
115 120 125 

ggc gta ace tac gac ace etc tec gtc cac gca cca aag tac tgc cag 432 
15 Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 

130 135 140 

tac ctt gca aga gag ctg cag aag etc ggc gcg acg ttt gag aga egg ' 4 80 

Tyr Leu Ala Arg' Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 
20 145 150 155 160 

acc gtt acg teg ctt gag cag gcg ttc gac ggt gcg gat ttg gtg gtc 52 8 

Thr Val Thr Ser Leu Glu Gin Ala Phe Asp ' Gly Ala Asp Leu Val Val 

165 170 175 

25 

aac get acg gga ctt ggc gcc aag teg att gcg ggc ate gac gac caa 57 6 

Asn Ala Thr Gly Leu Gly Ala Lys Ser lie Ala Gly lie Asp Asp Gin 

180 185 190 

30 gcc gcc gag cca ate cgc ggg .caa acc gtc etc gtc aag tec cca tgc 624 
Ala Ala Glu Pro lie Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
195 200 " 205 

aag cga tgc acg atg gac teg tec gac ccc get tct ccc gcc tac ate 672 
35 Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr lie 

210 215 220 



att ccc cga cca ggt ggc gaa gtc 
lie Pro Arg Pro Gly Gly Glu Val 
40 225 230 

gga gac tgg gac ttg tct gtc aac 
Gly Asp Trp Asp Leu Ser Val Asn 

245 



ate tgc ggc ggg acg tac ggc gtg 720 
lie Cys Gly Gly Thr Tyr Gly Val 
235 240 

cca gag acg gtc cag egg ate etc 7 68 

Pro Glu Thr Val Gin Arg lie Leu 
250 255 



45 
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aag cac tgc ttg cgc etc 
Lys His Cys Leu Arg Leu 

260 

5 gaa ggc ate gag gtc etc 
Glu Gly lie Glu Val Leu 
275 

cga ggc gga ccc cgc gtt 
10 Arg Gly Gly Pro Arg Val 

290 



gac ccg acc ate teg age 
Asp Pro Thr lie Ser Ser 
265 

cgc cac aac gtc ggc ttg 
Arg His Asn Val Gly Leu 
280 

gag gca gaa egg ate gtc 
Glu Ala Glu Arg lie Val 
295 300 



gac gga acg ate 816 
Asp Gly Thr lie 
270 

cga cct gca cga 8 64 

Arg Pro Ala Arg 

285 

ctg cct etc gac 912 
Leu Pro Leu Asp 



egg aca aag teg ccc etc teg etc ggc agg ggc age gca cga gcg gcg 960 
Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala ■ 
15 305 310 315 320 

aag gag aag gag gtc acg ctt gtg cat gcg tat ggc ttc teg agt gcg 1008 
Lys Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

325 330 335 

20 

gga tac cag cag agt tgg ggc gcg gcg -gag gat gtc gcg cag etc gtc 105 6 

Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 

340 *345 350 

» 

25 gac gag gcg- ttc cag egg tac cac ggc gcg gcg egg gag teg aag ttg 1104 
Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 
355 360 365 



30 



40 



tagggeggga tttgtggctg tattgeggge atctacaaga aaaaaaaaaa aaaaaa 1160 



<210> 2 

<211> 368 

<212> PRT 

35 <213> Rhodosporidium toruloides 

<400> 2 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 
15 10 15 



Leu Ser Ser Ala Leu lie Leu Ala Arg Lys Gly Tyr Ser Val His lie 

20 25 30 



Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
45 35 40 45 
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Pro Trp Ala 
50 

5 Pro Arg Gin 
65 

Leu Val Pro 

10 

Ala Gin Asn 



Pro Asn Tyr 

15 115 

Gly Val Thr 
130 

20 Tyr Leu Ala 
145 

Thr Val Thr 

25 

Asn Ala Thr 



Ala Ala Glu 

30 195 

Lys Arg Cys 
210 

35 He Pro Arg 
225 

Gly Asp Trp 

40 

Lys His Cys 



Gly Ala Asn Trp 

55 

Ala Lys Trp Glu 
70 

Thr Gly His Ala 
85 

Glu Asp Gly Leu 
100 

Arg Pro Leu Pro 



Tyr Asp Thr Leu 

135 

Arg Glu Leu Gin 
150 

Ser- Leu Glu Gin 
165 

Gly Leu Gly Ala 
180 

Pro He Arg Gly 



Thr Met Asp Ser 

215 

Pro Gly Gly Glu 
230 

Asp Leu Ser Val 
245 

Leu Arg Leu Asp 
260 



Thr Pro Phe Met Thr 

60 

Glu Ser Thr Phe Lys 

75 

Met Trp Leu Lys Gly 
90 

Leu Gly His Trp Tyr 
105 

Ser Ser Glu Cys Pro 
120 

Ser Val His Ala Pro 

140 

Lys Leu Gly Ala Thr 

155 

Ala Phe Asp Gly Ala 
170 

Lys Ser He Ala Gly 
185 

Gin Thr Val Leu Val 
200 

Ser Asp Pro Ala Ser 

220 

Val He Cys Gly Gly 

235 

Asn Pro Glu Thr Val 
250 

Pro Thr He Ser Ser 
265 



Leu Thr Asp Gly 



Lys Trp Val Glu 

80 

Thr Arg Arg Phe 
95 

Lys Asp He Thr 
110 

Pro Gly Ala He 
125 

Lys Tyr Cys Gin 



Phe Glu Arg Arg 

160 

Asp Leu Val Val 
175 

lie Asp Asp Gin 
190 

Lys Ser Pro Cys 
205 

Pro Ala Tyr He 



Thr Tyr Gly Val 

240 

Gin Arg He Leu 
255 

Asp Gly Thr He 
270 



Glu Gly He Glu Val Leu Arg His Asn Val Gly Leu Arg Pro Ala Arg 
45 275 280 285 
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Arg Gly Gly Pro Arg Val Glu Ala Glu Arg He Val Leu Pro Leu Asp 
290 295 300 

5 Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 
305 310 315 320 

Lys .Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

325 330 335 

10 

Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 

340 345 350 

Asp Glu Ala Phe Gin. Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 
.15 355 360 365 

<210> 3 

<211> 1005 

<212> DNA 

20 <213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1) . . (1002) 

25 <223> coding for DAAO 

* 

<400> 3 

atg gca aac ata att ccg aag att gca att ate ggc gaa gga gtc att 4 8 

Met Ala Asn He lie Pro Lys He Ala lie He Gly Glu Gly Val He 
30 1 5 10 15 

gga tgt act tea gca ctt caa .ata tea aaa get ata cca aat gcg aaa 96 
Gly Cys Thr Ser Ala Leu Gin He Ser Lys Ala He Pro Asn Ala Lys 

20 25 30 

35 

ata act gtg etc cac gat aaa cca ttt aaa aaa teg tgc agt gca gga 144 
He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala* Gly 
35 40 45 

40 cca gca gga tta ttt aga ate gat tat gag gag aat act gaa tac gga 192 
Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 60 

cgt get tct ttc gee tgg ttc tea cat' etc tat cgc act aca 'aaa gga 24 0 

45 Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
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65 70 75 80 

tec gaa acc ggc gtg aaa tta gtt tct gga cat att caa tec gac aac 288 

Ser .Glu Thr Gly Val Lys Leu Val Ser Gly His lie Gin Ser Asp Asn 

85 90 95 



ttg gag tea 
Leu Glu Ser 



aac ttt aga 
Asn Phe Arg 
115 

15 cca tea aag 
Pro Ser Lys 
130 



ttg aag caa 
Leu Lys Gin 
100 

ttc ttg gat 
Phe Leu Asp 

cac tgc att 
His Cys lie 



caa caa aga 
Gin Gin Arg 
105 

gat aga gaa 
Asp Arg Glu 
120 

cac tac acc 
His Tyr Thr 
135 



gee tat ggc 
Ala Tyr Gly 

egg ctg gac 
Arg Leu Asp 

gec tac gca 
Ala Tyr Ala 
' 140 



gat att gtg 
Asp lie Val 
110 

att ttt ccc 
lie Phe Pro 
125 

tea gaa ggt 
Ser Glu Gly 



tac 336 
Tyr 

gaa 384 
Glu 



aac 432 
Asn 



aag tac gtg cct tat ttg aag aat 
20 Lys Tyr Val Pro Tyr Leu Lys Asn 
145 150 

ttc aag caa caa gaa gtg acg agt 
Phe Lys Gin Gin Glu Val Thr Ser 
25 165 



ttg ctg ctt gag caa aaa ate gag 4 80 

Leu Leu Leu Glu Gin Lys lie Glu 
155 160 

ttg gac gca gtc gee gac get ggt 528 
Leu Asp Ala Val Ala Asp Ala Gly 
17 0 175 



" tac gat gtt att gta aac tgc gca ggc ttg tac ggt gga aag ttg get 576 

Tyr Asp Val lie Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

180 185 190 

30 

ggt gat gac gat act tgc tac ccc att aga gga gtc att ttg gaa gtt 624 

Gly Asp Asp Asp Thr Cys Tyr Pro lie Arg Gly Val lie Leu. Glu Val 
195 • 200 205 

35 gat gca cca tgg cac aag cac ttc aat tat cga gac ttt act act ttc 672 
Asp Ala Pro Trp His Lys ' His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
210 215 • 220 

4 

aca att cca aaa gag cac age gtg gtg gtt ggg tec acc aag cag gac 720 

40 Thr lie Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

» 

aat cga tgg gat ttg gag ate acc gac gag gat aga aat gat att ttg 7 68 

Asn Arg Trp Asp Leu Glu lie Thr Asp' Glu Asp Arg Asn Asp 'lie Leu 
45 245 250 255 
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aaa cga tac att get tta cat cct gga atg aga gag cca aag att ate 816 
Lys Arg Tyr lie Ala Leu His Pro Gly Met Arg Glu Pro Lys lie lie 

260 265 270 

5 

aaa gaa tgg tea gca ctt cgc ccg gga cgt aag cat gtc aga att gaa 8 64 

Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg lie Glu 
275 280 285 

10 gcg cag aag agg aca tct gtt gga aac tea aaa gat tat atg gtt gtg 912 
Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
290 295 300 

cat cac tat ggt cac ggg age aac gga ttc acg ttg ggt tgg gga aca ' 960 

15 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

gca att gaa gca act aaa ctt gtt aag act gca eta gga tta taa 1005 
Ala lie Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 
20 325 330 

<210> 4 

<211> 334 

<212> PRT 

25 <213> Caenorhabditis elegans 

<400> 4 

Met Ala Asn lie lie Pro Lys He Ala He He Gly Glu Gly Val He - 
1 5 .10 15 



30 



Gly Cys Thr Ser Ala Leu Gin lie Ser Lys Ala "He Pro Asn Ala Lys 

20 25* 30 



He Thr Val Leu His Asp Lys Pro Phe Lys Lys " Ser Cys Ser Ala Gly 
35 35 40 45 

Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 60 

40 Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
65 70 75 80 

Ser Glu. Thr Gly Val Lys Leu Val Ser Gly His He Gin S.er Asp Asn 

85 90' 95' 

45 
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Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp lie Val Tyr 

100 105 110 

Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He Phe Pro Glu 
5 115 120 125 

Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 

10 Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys He Glu 
145' 150 155 160 

Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 

165 170 175 

15 

Tyr Asp Val He Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

180 185 190 

Gly Asp Asp Asp Thr Cys Tyr Pro He Arg Gly Val He Leu Glu Val 
20 195 200 205 

Asp Ala Pro Trp His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
210 ' 215 220 

25 Thr He Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

Asn Arg Trp Asp Leu Glu He . Thr Asp Glu Asp Arg Asn Asp He Leu 

245 250 255 

30 

Lys Arg Tyr He Ala Leu His Pro Gly Met Arg Glu Pro Lys He He 

260 . 265 ■ 270 

Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg He Glu . 
35 27 5 280 285 

Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
290 295 300 

40 His His Tyr Gly His Gly Ser Asn Gly Phe Thr .Leu Gly Trp Gly Thr 
305 310 315 320 



45 



Ala He Glu Ala Thr Lys, Leu Val Lys Thr Ala Leu Gly Leu 

325 330' 
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<210> 5 

<211> 118 6 

<212> DNA. 

<213> Nectria haematococca 



10 



15 



<220> 
<221> 
<222> 
<223> 



CDS 

(42) . . (1124) 
coding for DAAO 



<400> 5 

agcgacttga. atttagcgaa aagaacttgt" caaccacaat c atg tec aac aca ate 

Met Ser Asn Thr lie 
1 5 

gtc gtc gtt ggt gec ggt gtc att ggc ttg acg teg gec ttg ttg etc 
Val Val Val Gly Ala Gly Val lie Gly Leu Thr Ser Ala Leu Leu Leu 

10 15 20 



56 



104 



20 tec aag aac aag ggc aac aag ate acc gtc gtg gec aag cac atg ccc 
Ser Lys Asn Lys Gly Asn Lys lie Thr Val Val Ala Lys His Met Pro 

25 • 30 35 



152 



ggc gac tat gac gtt gaa tac gec teg cct ttt get ggt gee aac cac 
25 Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe Ala Gly Ala Asn His 

4 0 45 50 



200 



tec ccc atg gcg acg gaa gag age age gaa tgg gaa cgt cgc act tgg 24 8 

Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp Glu Arg Arg Thr Trp 
30 55 60 65 

tac gag ttt aag aga ctg gtc gag gag gtc cct gag gee ggt gtt cat 296 

Tyr Glu Pile Lys Arg Leu Val Glu Glu Val Pro Glu Ala Gly Val His 
70 75 80 85 

35 

ttc cag aag tct cgc ate cag agg cgc aat gtg gac act gaa aag gcg 34 4 

Phe Gin Lys Ser Arg lie Gin Arg Arg Asn Val Asp Thr Glu Lys Ala 

90 ' 95 100 

40 cag agg tct ggt ttc cca gac gee etc ttc teg aaa gaa ccc tgg ttc 392 
Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser Lys Glu Pro Trp Phe 

105 110 115 

aag aac atg ttt gag gac ttc cgt gag cag cac cct age gag gtc ate 44 0 

45 Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His Pro Ser Glu Val lie 



WO 2005/090581 PCT/EP2005/002734 

10/40 

120 125 130 



536 



584 



632 



680 



ccc ggt tac gac tct ggc tgc gag ttc aca teg gtg tgc ate aac acg 488 
Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser Val Cys He Asn Thr 
135 140 145 

gec ate tac etc ccc tgg etc etc ggc cag tgc ate aag aat ggc gtc 
Ala lie Tyr Leu Pro Trp Leu Leu Gly Gin Cys He Lys Asn Gly Val 
150 155 160 165 

ate gtc aag cgc gee ate etc aac gac att age gag gee aag aag ctg 
He Val Lys Arg Ala He Leu Asn Asp He Ser Glu Ala Lys Lys Leu 

170 175 180 

age cac gcg ggc aag acg ccc aat ate ate gtc aac gee acg ggt etc 
Ser His Ala Gly Lys Thr Pro Asn He He Val Asn Ala Thr Gly Leu 

185 190 - 195 

ggc tec tac aag ctg ggc ggt gtc gag gac aag ace atg gcg cct gcg 
Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys Thr Met Ala Pro Ala" 
200 205 210 

egg gga cag att gtg gtt gtg cgc aac gag age age . ccc atg etc etc 
Arg Gly Gin He Val Val Val Arg Asn Glu Ser Ser Pro Met Leu Leu - 
215 220~ ' 225 

act tea ggt gtc gag gac ggc ggt get gat gtc atg tac ttg atg cag 77 6 

Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val Met Tyr Leu Met Gin 
230 235 240 245 

cga gca get ggc ggt ggc ace ate ctg ggc ggt ace tac gac gtt ggc 824 
Arg Ala Ala Gly Gly- Gly Thr He Leu Gly Gly Thr Tyr Asp Val Gly 

250 • 255 260 

* 

aac tgg gag tct cag cca gac ccc aac ate gcg aat cgc ate atg cag 
Asn Trp Glu Ser Gin Pro Asp Pro Asn He Ala Asn Arg He Met Gin 

265 270 275 

cgc ate gtc gag gtg egg ccc gag att gee aac ggc. aag ggc gtc aag 920 
Arg He Val Glu Val Arg Pro Glu He Ala Asn Gly Lys Gly Val Lys 
280 285 290 



728 



872 



ggg ctg age gtg ate cga cac gee gtc ggc atg egg ccg tgg cga aag 
Gly Leu Ser Val He Arg Mis Ala Val Gly Met Arg Pro Trp Arg Lys 
295 300 305 



968 
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gac gga gtc agg ate gag gag gag aag ctg gat gat gag act tgg ate 1016 
Asp Gly Val Arg lie Glu Glu Glu Lys Leu Asp Asp Glu Thr Trp lie 
310 315 320 325 

5 

gtg cac aac tac gga cac tct gga tgg ggt tac cag ggt teg tat ggt 1064 
Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr Gin Gly Ser Tyr Gly 

330 335 340 

10 tgt get gag aat gta gtc cag ttg gtt gac aag gtc ggc aag gcg gec 1112 
Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys Val Gly Lys Ala Ala 

345 350 355 

aag tct aag ctg tagttgaaaa ggcctgaatg agtaatagta attggatatt 1164 
15 Lys Ser Lys Leu 

360 

ggaaataccg tatttgeect eg 118 6 

20 <210> 6 

<211> 361 

<212> PRT 

<213> Nectria haematococca 

25 <400> 6 

Met Ser Asn Thr lie Val Val Val .Gly Ala Gly Val lie Gly Leu Thr 
1 5 10 15 

Ser Ala Leu Leu Leu Ser Lys 'Asn Lys Gly Asn Lys lie Thr Val Val 
30 20 25 30 

Ala Lys His Met Pro Gly Asp Tyr Asp Val .Glu Tyr Ala Ser Pro Phe 
35 40 45 

35 Ala Gly Ala Asn His Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp 

50 55 60 



40 



Glu Arg Arg Thr Trp Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro 

65 70 75 80 

Glu Ala Gly Val His Phe Gin Lys Ser Arg lie Gin Arg Arg Asn Val 

85 90 95 



Asp Thr Glu Lys Ala Gin Arg' Ser Gly Phe Pro Asp 'Ala Leu Phe Ser 
45 100 105 110 
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Lys Glu Pro 
115 

5 Pro Ser Glu 
130 

Val Cys lie 
145 

10 

He Lys Asn 
Glu Ala Lys 

15 

Asn Ala Thr 

* 

195 

20 Thr Met Ala 

210 

Ser Pro Met 

• 225 

25 

Met Tyr Leu 
Thr Tyr Asp 

30 

Asn Arg He 
275 

35 

Gly Lys Gly 
290 

Arg Pro Trp 
40 305 

Asp Glu Thr 



Trp Phe Lys Asn 



Val He Pro Gly 

135 

Asn Thr Ala lie 
150 

Gly Val lie Val 
165 

Lys Leu Ser His 
180 

Gly Leu Gly Ser 



Pro Ala Arg Gly 

215 

Leu Leu Thr Ser 
230 

Met Gin Arg Ala 
245 

Val Gly Asn Trp 
2,60 

Met Gin Arg He 



Val Lys Gly Leu 

295 

Arg Lys Asp Gly 
310 

Trp He Val His 
325 



Met Phe Glu Asp Phe 
120 

Tyr Asp Ser Gly Cys 

140 

Tyr Leu Pro Trp Leu 

155 

Lys Arg Ala He Leu 
170 

Ala Gly Lys Thr Pro 
185 

Tyr Lys Leu Gly Gly 
200 

Gin He Val Val Val 

220 

Gly Val Glu Asp Gly 

235 

Ala Gly Gly Gly Thr 
250 

Glu Ser Gin Pro Asp 
265 

Val Glu Val Arg -Pro 
280 



Ser Val He Arg His 

300 

Val Arg He Glu Glu 

315 

Asn Tyr Gly His Ser 
330 



Arg Glu Gin His 
125 

Glu Phe Thr Ser 



Leu Gly Gin Cys 

160 

Asn Asp He Ser 
175 

Asn He He Val 
190 

Val Glu Asp Lys 
205 

Arg Asn Glu Ser 



Gly Ala Asp Val 

240 

lie Leu Gly Gly 
255 

Pro Asn He Ala 
270 

Glu lie Ala Asn 
285 



Ala Val Gly Met 



Glu Lys Leu Asp 

320 

Gly Trp Gly Tyr 
335 



45 
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Gin Gly Ser Tyr Gly Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys 

340 345 350 

Val Gly Lys Ala Ala Lys Ser Lys Leu 
5 355* 360 

<210> 7 

<211> 1071 

<212> DNA 

10 <213> Trigonopsis variabilis 

<220> 
<221> CDS 
<222>* (1) . . (1068) 
15 <223> 

<400> 7 ■ 

atg get aaa ate gtt gtt att ggt gee ggt gtt gec ggt tta act aca 48 

Met Ala Lys lie Val Val He Gly Ala Gly Val Ala Gly Leu Thr Thr 
20 1 5 .10 15 

get ctt caa ctt ctt cgt aaa gga cat gag gtt aca att gtg tec gag 96 

Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr" He Val Ser Glu 

20 25 . 30 

25 

" . ttt acg ccc ggt gat ctt agt ate gga tat ace teg cct tgg gca ggt 144 

Phe Thr Pro Gly Asp Leu Ser lie Gly Tyr Thr Ser Pro Trp Ala Gly 
35 40 45 

30 gec aac tgg etc aca ttt tac gat gga ggc aag tta gee gac tac gat 192 
Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
50 55 60 

gec gtc tct tat cct ate ttg cga gag ctg get cga age age ccc gag 240 
35 Ala Val Ser Tyr Pro lie Leu Arg Glu Leu 'Ala Arg Ser Ser Pro Glu 
65 '70 75 80 

get gga att cga etc ate age caa cgc tec cat gtt etc aag cgt gat 28 8 

Ala Gly He Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 
40 85 90 95 

ctt cct aaa ctg gaa gtt gee atg teg gec ate tgt caa cgc aat ccc 336 

Leu Pro Lys Leu Glu Val Ala Met Ser Ala He Cys Gin Arg Asn Pro 

'100 105 110 

45 
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tgg ttc aaa aac aca gtc gat tct ttc gag att ate gag gac agg tec 38 4 

Trp Phe Lys Asn Thr Val Asp Ser Phe Glu lie lie Glu Asp Arg Ser 
115 120 125 

5 agg att gtc cac gat gat gtg get tat eta gtc gaa ttt cgt tec gtt 432 

Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 

130 135 140 

tgt ate cac acc gga gtc tac ttg aac tgg ctg atg tec caa tgc tta 48 0 

10 Cys lie His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 
145 150 155 160 

teg etc ggc gec acg gtg- gtt aaa cgt cga gtg aac cat ate aag gat 528 

Ser 'Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His lie Lys Asp 
15 165 170 . 175 

gee aat tta eta cac tec tea gga tea cgc ccc gac gtg att gtc aac 57 6 

Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val He Val Asn 

180 185 ' 190 

20 

tgt agt ggt etc ttt gec egg ttc ttg gga ggc gtc gag gac aag aag 624 

Cys Ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Lys 
195 " . 200 . 205 

25 atg tac cct att cga gga caa gtc gtc ctt gtt cga aac tct ctt cct 672 

Met Tyr Pro He Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 

210 215 . • 220 

ttt atg gec tec ttt tec age act cct gaa aaa gaa aat gaa gac gaa 720 

30 Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu 'Asn Glu Asp Glu 
225 230 . 235 240 

* • 

get eta tat ate atg acc cga ttc gat ggt act tct ate att ggc ggt 7 68 

Ala Leu Tyr He Met Thr Arg Phe Asp Gly. Thr Ser He lie Gly Gly • 
35 245 250 255 

tgt ttc caa ccc aac aac tgg tea tec gaa ccc gat cct tct etc acc 816 

Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 

260 ' 265 270 



40 



cat cga ate ctg tct aga gee etc gac cga ttc ccg gaa ctg acc aaa 8 64 

His Arg He Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
275 280 285 



45 
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gat ggc cct ctt gac att gtg cgc gaa tgc gtt ggc cac cgt cct ggt 912 
Asp Gly Pro Leu Asp lie Val Arg Glu Cys Val Gly His Arg Pro Gly 
290 295 300 

5 aga gag ggc ggt ccc cga gta gaa tta gag aag ate ccc ggc gtt ggc 960 
Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys lie Pro Gly Val Gly 
305 310 315 320 

ttt gtt gtc cat aac tat ggt gec gec ggt get ggt tac caa tec tct 1008 
10 Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 330 335 

tac ggc atg get gat gaa get gtt tct tac gtc gaa aga get ctt act 105 6 

Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 
15 340 345 350 

cgt cca aac ctt tag 1071 
Arg Pro Asn Leu 
355 

20 

<210> 8 
<211> 356 
<212> PRT 
25 <213> Trigonopsis variabilis 

<400> 8 

Met Ala Lys lie Val Val lie Gly Ala Gly Val Ala Gly Leu Thr Thr 
1 5 10 15 



30 



Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr lie Val Ser Glu 

20 25 30 



Phe Thr Pro Gly Asp Leu Ser lie Gly .Tyr Thr Ser Pro Trp Ala Gly 
35 35 40 45 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
50 55 60 

40 Ala Val Ser Tyr Pro lie Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 75 80 

Ala Gly lie Arg Leu lie Sex Gin Arg Ser His Val Leu Lys Arg Asp 

'85 - '90 95 

45 
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Leu Pro Lys Leu Glu Val Ala Met Ser Ala lie Cys Gin Arg Asn Pro 

100 105 110 

Trp Phe Lys Asn Thr Val Asp Ser Phe GILu lie lie Glu Asp Arg Ser 
5 115 120 125 

Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 
130 135 140 

10 Cys lie His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 
145 150 155 " 160 

♦ 

Ser Leu Gly Ala Thr Val Val Lys Arg T^xg Val Asn His lie Lys Asp 

165 170 175 

15 

Ala Asn Leu Leu His Ser Ser Gly Ser JVrg Pro Asp Val lie Val Asn 

180 185 190 

Cys Ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Lys 
20 195 200 205 

Met Tyr Pro lie Arg Gly Gin Val ■ Val Leu Val Arg Asn Ser Leu Pro 
210 215 220 

25 Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 
225 230 235 240 

Ala Leu Tyr lie Met Thr Arg Phe Asp Gly Thr Ser lie lie Gly Gly 

245 • 250 255 

30 

Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro . Ser Leu Thr 

260 ' 265 270 

His Arg lie Leu Ser Arg Ala Leu Asp Arg Phe Pro. Glu Leu Thr Lys . 
35 275 280 285 

Asp Gly Pro Leu Asp lie Val Arg Glu Cys Val Gly His Arg Pro Gly 
290 295 300 

40 Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys lie Pro Gly Val Gly • 
305 310 315 320 



Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

32'5 330 335' 

45 
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Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 

340 345 350 

Arg Pro Asn Leu 
5 355 



10 



<210> 9 

<211> 1047 

<212> DNA 

<213> Schizosaccharomyces pombe 



15 



<220> 
<221> 
<222> 
<223> 



CDS 

(22) . . (1041) 
coding for DAAO 



20 



<400> 9 

atgactaagg aaaataagcc a aga gat att gtc ate gtt ggc get ggc gtt 

Arg Asp lie Val lie Val Gly Ala Gly Val 
- 1 5 10 



51 



25 



att gga ttg acc act get tgg att ctt tea gac ttg ggt ctt get cct 
lie Gly Leu Thr Thr Ala Trp lie Leu. Ser Asp Leu Gly Leu Ala Pro 

15 20 25 



99 



30 



cgt att aag gtg att gee aag tat acg cct cjaa gat cgt tct gta gaa 

Arg lie Lys Val lie Ala Lys Tyr Thr Pro. Glu Asp Arg Ser Val Glu 

30 35 40 

tac act tec cct tgg get ggc gca aat ttc tgt age - att tct get act 

Tyr Thr Ser Pro Trp Ala Gly Ala Asn Phe Cys Ser lie Ser Ala Thr 
45 50 55 



147 



195 



35 gat' gac aat get ttg cgc tgg gat aaa ate act tac cat cgt ttc gee 
Asp Asp Asn Ala Leu Arg Trp Asp Lys lie Thr Tyr His Arg Phe Ala 
60 65 70 



243 



tac ttg gcg aaa act cgt cct gaa gca gga ate cgt ttt get gat ctt 
40 Tyr Leu Ala Lys Thr Arg Pro Glu Ala Gly lie Arg Phe Ala Asp Leu 
75 80 85 90 



291 



45 



cga gaa ttg tgg gag tac gag ccg aaa cac cjac aaa ate aga tec tgg 
Arg Glu Leu Trp Glu 'Tyr Glu Pro Lys His 'Asp Lys lie Arg Ser Trp 

95 100 105 



339 
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aat acc tat gtc aga gat ttc aaa gtt ate cct gaa aaa gat ctt cca 387 

Asn Thr Tyr Val Arg Asp Phe Lys Val lie Pro Glu Lys Asp Leu Pro 

110 115 120 

5 gga gaa tgt ate tac gga cat aag gec acc acc ttt tta ate aac get 435 
Gly Glu Cys He Tyr Gly His Lys Ala Thr Thr Phe Leu He Asn Ala 
125 130 , 135 

cct cat tac ttg aat tat atg tac aag ctg etc att gaa get ggc gtc 4 83 

10 Pro His Tyr Leu Asn Tyr Met Tyr Lys Leu Leu lie Glu Ala Gly Val 

140 . 145 150 

gaa ttt gaa aag aaa gaa ttg agt cac ate aaa gag act gtc gaa gaa 531 
Glu Phe Glu Lys Lys Glu Leu Ser His He Lys Glu Thr Val Glu Glu 
15 155 160 165 170 

act cca gaa get tea gta gta ttt aat tgc act ggt etc tgg get tec 57 9 

Thr Pro Glu Ala Ser Val Val Phe Asn Cys Thr Gly Leu Trp Ala Ser 

175 180 185 

20 

aaa ttg ggt ggc gtt gaa gac ccg gac gtt tat ccg act cgt gga cat 627 

Lys Leu Gly Gly Val Glu Asp Pro Asp Val Tyr Pro Thr Arg Gly His 

190 195 200 

25 gtt gtt ttg gtt aag get cct cat gta aca gaa act cgc att ttg aat 675 
Val Val Leu Val Lys Ala Pro His Val Thr Glu Thr Arg He Leu Asn 
205 210 215 

ggc aag aac tct gat acc tat att att cct cgt ccc tta aat ggt gga 723 
30 Gly Lys Asn Ser Asp Thr Tyr He He Pro Arg Pro Leu Asn Gly Gly 

220 . 225 230 

gtc att tgc ggc ggt ttc atg caa cca gga aac tgg gat cgt gaa att 771 
Val He Cys Gly Gly Phe Met Gin Pro Gly Asn Trp Asp Arg Glu He ■ 
35 235 240 245 250 

cac cct gaa gac act ttg gat ate ctt aag aga aca teg get ttg atg 819 
His Pro Glu Asp Thr Leu Asp He Leu Lys Arg Thr Ser Ala Leu Met 

255 260 265 



40 



cca gaa ttg ttc cac ggc aag ggt ccg gag ggt get gaa att att caa 8 67 

Pro Glu Leu Phe His Gly Lys Gly Pro Glu Gly Ala Glu -lie He Gin 

270 275 280 



45 
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gaa tgt gtc gga ttc cgt cct tct cga aag ggt ggt gcc cgc gta gag 915 
Glu Cys Val Gly Phe Arg Pro Ser Arg Lys Gly Gly Ala Arg Val Gin 
285 290 295 

5 ctt gat gtt gtt ccc ggc acc tea gtc ccc ctt gtt cat gat tac ggt 963 
Leu Asp Val Val Pro Gly Thr Ser Val Pro Leu Val His Asp Tyr Gly 
300 305 310 

get tct ggc aca gga tac caa get ggt tat ggt atg get ctt gac -fcet 1011 
10 Ala Ser Gly Thr Gly Tyr Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser 
315 320 325 330 

gtc atg ttg get ctt cct aaa ate aaa ttg gcttag , 1047 

Val Met Leu Ala Leu Pro Lys lie Lys Leu 
15 335 340 



<210> 10 

<211> 340 

20 <212> PRT 

<213> Schizosaccharomyces pornbe 

<400> 10 

Arg Asp lie Val lie Val Gly Ala Gly Val lie Gly Leu Thr Thr Ala 
25 1 5 10 15" 

Trp lie Leu Ser Asp Leu Gly Leu Ala Pro Arg He Lys Val He Ala 

.20 25 -30 

30 Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu Tyr Thr Ser- Pro Trp Ala 

35 40 45 

Gly Ala Asn Phe Cys Ser He Ser Ala Thr Asp Asp Asn Ala "Leu Arg 
50 55 60 



35 



Trp Asp Lys He Thr Tyr His Arg Phe Ala Tyr Leu Ala Lys Thr Arg 
65 70 75 80 



Pro Glu Ala Gly He Arg Phe Ala Asp Leu Arg Glu Leu Trp Glu Tyr 
40 85 90 95 

Glu Pro Lys His Asp Lys He Arg Ser Trp Asn Thr Tyr Val Arg Asp 

100 105 110 



45 
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Phe Lys Val lie Pro Glu Lys Asp Leu Pro Gly Glu Cys lie Tyr Gly 
115 120 125 

His Lys Ala Thr Thr Phe Leu lie Asn Ala Pro His Tyr Leu Asn Tyr 
5 130 135 140 

Met Tyr Lys Leu Leu lie Glu Ala Gly Val Glu Phe Glu Lys Lys Glu 
145 150 155 160 

10 Leu Ser His lie Lys Glu Thr Val Glu Glu Thr Pro Glu Ala Ser Val 

165 170 175 

Val Phe Asn Cys Thr Gly Leu Trp Ala Ser Lys Leu Gly Gly Val Glu 

180 185 190 

15 

Asp Pro Asp Val Tyr Pro Thr Arg Gly His Val Val Leu Val Lys Ala 
195 200 205 

Pro His Val Thr Glu Thr Arg lie Leu Asn Gly Lys Asn Ser Asp Thr 
20 210 215 220 

Tyr lie lie Pro Arg Pro Leu Asn Gly Gly Val lie Cys Gly Gly Phe 
225 230 235 240 

25 Met Gin Pro Gly Asn Trp Asp Arg Glu lie His Pro Glu Asp Thr Leu 

245 250 255 

Asp lie Leu Lys Arg Thr Ser Ala Leu Met Pro Glu Leu Phe His Gly 

260 265 270 

30 

Lys Gly Pro Glu- Gly Ala Glu lie lie Gin Glu Cys Val Gly Phe Arg 
275 280 285 



35 Pro Ser Arg Lys Gly Gly Ala Arg Val Glu Leu Asp Val Val Pro Gly 

290 295 300 

Thr Ser Val Pro Leu Val His Asp Tyr Gly Ala Ser Gly Thr Gly Tyr 
305 310 315 320 

40 

Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser Val Met Leu Ala Leu Pro 

325 330 • 335 

Lys lie' Lys Leu 
45 340 
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<210> 11 

<211> 963 

<212> DNA 

<213> Streptomyces coelicolor 



PCT/EP2005/002734 



21/40 



10 



15 



20 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



CDS 

(31) . . (957) 
coding for DAAO 



misc__f eature 
(880) . . (936) 
DAAO signature 



<400> 11 

gtggaaaccg aactggatga cgagcgggat ggc gaa gtc gtc gtg gtc ggc ggc 

Gly Glu Val Val Val Val Gly Gly 
1 5 

ggg gtg ate ggg ctg acg acg gec gtc gtc etc gec gag egg ggc aga 
Gly Val lie Gly Leu Thr Thr Ala Val Val Leu. Ala Glu Arg Gly Arg 
10 15 20 



54 



102 



25 egg gtg egg ctg tgg. ace egg gag ecc gcg gag egg ace ace teg gtg 
Arg Val Arg Leu Trp Thr Arg Glu Pro Ala Glu Arg Thr Thr Ser Val 
25 30 35 40 



15 0 



gta gcg ggc ggg ctg; tgg tgg ccg tac egg ate gag ccg gtc gcg ctg 
30 Val Ala Gly Gly Leu Trp Trp Pro Tyr Arg lie Glu Pro Val Ala Leu 

45 50 55 



198 



35 



gee cag gee tgg gcg ctg cgt tec ctg gac gtg tac gag gag ctg gcg 
Ala Gin Ala Trp Ala Leu Arg Ser Leu Asp Val Tyr Glu Glu Leu Ala 

60 65 70 



24 6 



gca egg ccc ggg cag ace ggc gta cgc atg etc gaa ggg gtg etc ggc 29 4 

Ala Arg Pro Gly Gin Thr Gly Val Arg Met Leu Glu Gly Val Leu Gly 
75 80 85 

40 

gag ace ggc ctg gac gag gtg gac ggg tgg gee gcg gee egg ctg ccg 34 2 

Glu Thr Gly Leu Asp Glu Val Asp Gly Trp Ala Ala Ala Arg Leu Pro 
90 95 100 



45 ggg ctg cgc gcg gcg age gec gec gag tac gee ggg acg ggg ctg tgg 



3&0 
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Gly Leu Arg Ala Ala Ser Ala Ala Glu 
105 110 

gcg egg ctg ccg etc ate gac atg teg 
5 Ala Arg Leu Pro Leu lie Asp Met Ser 

125 

gag egg ctg ctg gee gcg ggc ggc acg 
Glu Arg Leu Leu Ala Ala Gly Gly Thr 
10 140 145 



PCT/EP2005/002734 

Tyr Ala Gly Thr Gly Leu Trp 
115 . 120 

acc cat ctg ccg tgg ctg egg 438 
Thr His. Leu Pro Trp Leu Arg 
130 135 

gtg gag gac cgc gcg gtg acc 48 6 

Val Glu Asp Arg Ala Val Thr 

150 



gat ctg gec gag gcg 
Asp Leu Ala Glu Ala 
155 

15 

ggc gee egg gag ctg 
Gly Ala Arg Glu Leu 
170 

20 cag ctg gtc. gtc gtg 
Gin Leu Val Val Val 
185 

gee gac gcg gac tec 
25 Ala Asp Ala Asp Ser 

205 

egg etc ctg ctg ggc 
Arg Leu Leu Leu Gly 
30 220 

ccg gac ccg gag gtc 
Pro Asp Pro Glu Val 
235 

35 

cgt ccc gag ate gee 
Arg Pro Glu lie Ala 
250 



gac gcg ccg gtg gtg gtc aac 
Asp Ala Pro Val Val Val Asn 
160 

gtg ccg gac ccg gcg gta egg 
Val Pro Asp Pro Ala Val Arg 
175 180 

gag aac ccc ggc ate cac aac 
Glu- Asn Pro Gly lie. His Asn 
19-0 . 195 

ggg gag acg acg tac ttc ctt 
Gly Glu Thr Thr Tyr Phe Leu 

210 

ggc acg get gag gag gac gee 
Gly Thr Ala Glu Glu Asp Ala 

225 

gcg gcg gee ate gtg cga egg 
Ala Ala Ala lie Val Arg Arg 
240 

gga gcg egg gtg etc gcg cac 
Gly Ala Arg Val Leu Ala His 
255 260 



tgc acc ggc ctg 534 

Cys Thr Gly Leu 

165 

ccg gtg cgc gga 582 
Pro Val Arg Gly 



tgg ctg gtc gcg 630 
Trp Leu Val Ala 

' 200 

ccg cag ccg gga 67 8 

Pro Gin Pro Gly 
215 

tgg teg acc gag 72 6 

Trp Ser Thr Glu 
230 . 

tgc gcg gee ctg 774 

Cys Ala Ala Leu 

245 

ctg gtg ggg ctg 822 
Leu Val Gly Leu 



egg ccg gee egg gac gcg gtc egg ctg gag cgc ggg acg ctg ccg gac 87 0 

Arg Pro Ala Arg Asp Ala Val Arg Leu Glu Arg Gly Thr Leu Pro Asp 
265 270 275 280 



ggg cgc egg ctg gtg cac aac tac ggt cac ggc ggc gcg ggc gtc acc 
45 Gly Arg Arg Leu Val His Asn Tyr Gly His Gly Gly Ala Gly Val Thr 



918 
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285 290 295 

gtg gcc tgg ggc tgc get cag gag gcg gec egg etc gec tcctga 963 
Val Ala Trp Gly Cys Ala Gin Glu Ala Ala Arg Leu Ala 
5 300 305 



<210> 12 

<211> 309 

10 <212> PRT 

<213> Streptomyces coelicolor 

<220> 

<221> misc_f eature 

15 <222> (880) . . (936) 

<223> DAAO signature 

<400> 12 

Gly Glu Val Val Val Val Gly Gly Gly Val He Gly Leu Thr Thr Ala 
20 1 5 • 10 15 

Val Val Leu Ala Glu Arg Gly Arg Arg Val Arg Leu Trp Thr Arg Glu 

20 25 30 

25 Pro Ala Glu Arg Thr Thr Ser Val Val Ala Gly Gly Leu Trp Trp Pro 

35 40 . 45 

Tyr Arg He Glu Pro Val Ala Leu Ala Gin Ala Trp Ala Leu Arg Ser 
50 55 60 



30 



Leu Asp Val Tyr Glu Glu Leu Ala Ala Arg Pro Gly Gin Thr Gly Val 
65 70 75 80 



Arg Met Leu Glu Gly Val Leu Gly Glu Thr Gly Leu Asp Glu Val Asp 
35 85 90 95 

Gly Trp Ala Ala Ala Arg Leu Pro Gly Leu Arg Ala Ala Ser Ala Ala 

100 105 110 

40 Glu Tyr Ala Gly Thr Gly Leu Trp Ala Arg Leu Pro Leu He Asp Met 

115 120 125 

Ser Thr His Leu Pro Trp Leu Arg Glu Arg Leu Leu Ala Ala Gly Gly 
130 135 ' 140 

45 



WO 2005/090581 PCT/EP2005/002734 

24/40 

Thr Val Glu Asp Arg Ala Val Thr Asp Leu Ala Glu Ala Asp Ala Pro 
145 150 ' 155 160 

Val Val Val Asn Cys Thr Gly Leu Gly Ala Arg Glu Leu Val Pro Asp 
5 165 170 175 

* 

Pro Ala Val Arg Pro Val Arg Gly Gin Leu Val Val Val Glu Asn Pro 

180 185 190 

10 Gly lie His Asn Trp Leu Val Ala Ala Asp Ala Asp Ser Gly Glu Thr 

195 200 20S 

Thr Tyr Phe Leu Pro Gin Pro Gly Arg Leu Leu Leu Gly^ Gly Thr Ala 
210 215 220 



15 



30 



Glu Glu Asp Ala Trp Ser Thr Glu Pro Asp Pro Glu VaL Ala Ala Ala 
225 230 235 240 



lie Val Arg Arg Cys Ala Ala Leu Arg Pro Glu lie Ala Gly Ala Arg 
20 245 250 255 

Val Leu Ala His Leu Val.- Gly Leu Arg Pro Ala Arg Aspz> Ala Val Arg 

260 ^265 " 270 

25 Leu Glu Arg Gly Thr Leu Pro Asp Gly Arg Arg Leu Val His Asn Tyr 

275 280 285 

Gly His Gly Gly Ala Gly Val Thr Val Ala Trp Gly Cys Ala Gin Glu 
290 . 295 300 



Ala Ala Arg Leu Ala 
305 



<210> 13 

35 <211> 1038 

<212> DNA 

<213> Candida boidinii 

<220> 

40 <221> CDS 

<222> (1) . . (1035) 

<223> coding for DAAO 



45 
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<400> 13 

atg ggt gat caa att gtt .gtt ctt ggt tec ggt att att ggt tta tat 48 

Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie lie' Gly Leu Tyr 

15 10 15 

5 

act aca tac tgt tta ate tat gag get gga tgt get cca get aaa att 96 

Thr Thr Tyr Cys Leu lie Tyr Glu Ala Gly Cys Ala Pro Ala Lys lie 

20 25 30 

10 act att gtt get gaa ttt tta cca ggt gat caa tct aca tta tat aca 14 4 

Thr lie Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 

tct cca tgg gca ggt ggt aat ttt tct tgt att tea cca get gat gat 192 

15 Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys lie Ser Pro Ala Asp Asp 

50 55 60 

aca aca ttg get tat gat aaa ttc aca tat ctt aat tta ttc aag att 240 

Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys lie 
20 65 70 • 75 80 

cac aaa aaa tta ggt gga cca gaa tgt gga tta gat aat aag cca agt 2 88 

His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 

'85 90 95 

25 

act gaa tat tgg gat ttt tat cct ggt gat gaa aaa gtc aat tct tta 336 

Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 105 , 110 

30 aaa caa tat ctt aaa gat ttt aaa gtt att cca aaa tea gaa tta cca 384 

Lys Gin Tyr Leu Lys Asp Phe Lys Val lie Pro Lys Ser Glu Leu Pro 
115 120 125 

gaa ggt gtt gaa tat ggt att agt tat act aca tgg aat ttc aac tgt 432 

35 Glu Gly Val Glu Tyr Gly lie Ser Tyr Thr Thr Trp Asn Phe Asn Cys 

130 135 140 

cct gtt ttc tta caa aat atg get aat ttt tta aat aaa aga aat gtt 4 80 

Pro Val- Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
40 145 150 155 160 

acc att att aga aaa cat tta aca cat att tct caa get tat tta aca 528 

Thr lie He Arg Lys His Leu Thr His lie Ser Gin Ala Tyr Leu Thr 

165 ' 170 ' 175 * 

45 
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gtt aat aca aaa gtt gtt ttc aac tgtaca ggt att ggt get get gat 57 6 

Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly lie Gly Ala Ala Asp 

180 185 190 

5 tta ggt ggt gtt aaa gat gaa aaa gtt tat cca act aga gga caa gtt 624 

Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

* 

gtt gtt gtt aga get cca cat att caa gaa • aat aaa atg aga tgg ggt 672 

10 Val Val Val Arg Ala Pro His lie Gin Glu Asn Lys Met Arg Trp Gly 

210 215 220 

aaa gac tat get act tat att att cca aga cca tat tct aat ggt gaa 720 
Lys Asp Tyr Ala Thr Tyr lie lie Pro Arg Pro Tyr Ser Asn Gly Glu 
15 225 • 230 235 240 

tta gtc tta ggt ggt ttc tta caa aag gat aat tgg aca ggt aat act 7 68 

Leu Val Leu Gly Gly Phe Leu Gin -Lys Asp Asn Trp Thr Gly Asn Thr 

245 ■ ' • 250 255 

20 

ttt ggt ttt gaa act gat gat att gtt agt aga act aca tct tta tta 816 

Phe Gly Phe Glu Thr Asp Asp,. lie Val. Ser Arg. Thr Thr Ser Leu Leu 

260 . 265 270 

25 cca aag att tta gat gaa cca ctt cat att att aga gtt gca get ggt 8 64 

Pro Lys lie Leu Asp Glu Pro Leu His lie lie Arg Val Ala Ala Gly 
275 280 285 

tta aga cca agt aga cat ggt ggt cca aga att gaa get gaa gtt tgt 912 
30 Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 

290 • 295 300 

gaa gaa ggt aaa tta act att cat aat tat ggt get tct gga tat ggt 960 
Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
35 305 310 315 320 

tat caa get ggt tat ggt atg tct tat gaa get gtc aaa ctt tta gtt 1008 
Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 



40 



gat aac caa aaa gtt aaa get aaa ctt tag 1038 
Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 345 
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<210> 14 
<211> 345 
<212> PRT 

<213> Candida boidinii 

5 

<400> 14 

Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie lie Gly Leu Tyr 
15 10 15 

10 Thr Thr Tyr Cys Leu He Tyr Glu Ala Gly Cys Ala Pro Ala Lys He 

20 25 30 

Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 

15 

Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys lie Ser Pro Ala Asp Asp 
50 55 60 

Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys lie 
20 65 70 . 75 80 

His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys . Pro Ser 

85 . 90 . 95 

25 Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 105 110 

Lys Gin Tyr Leu Lys Asp Phe Lys Val He Pro Lys Ser Glu Leu Pro 
115 120 125 

30 

Glu Gly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe Asn Cys 
130 135 140 

Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
35 145 150 155 160 

Thr He He Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 

165 170 175 

40 Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly He Gly Ala Ala Asp 

180 185 190 



Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

45 



WO 2005/090581 



28/40 



PCT/EP2005/002734 



Val Val Val Arg Ala Pro His He Gin Glu Asn Lys Met Arg Trp Gly 

210 215 220 

Lys Asp Tyr Ala Thr Tyr lie He Pro Arg Pro Tyr Ser Asn Gly Glu 
5 225 230 235 240 



10 



25 



Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

Phe Gly Phe Glu Thr Asp Asp He Val Ser Arg Thr Thr Ser Leu Leu 

260 265 270 



Pro Lys He Leu Asp Glu Pro Leu His lie He Arg Val Ala Ala Gly 
15 275 280 285 

Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

20 Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
305 310 315 320 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 



Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 345 



30 <210> 15 

• <211> 12466 

<212> DNA 

<213> vector ciaaoScelTetON 

35 <220> 

<221> misc_f eature 

<222> (38) . . (183) 

<223> Agrobacterium right border 

40 <220> 

<221> misc__f eature 

<222> (445) . . (462) 

<223> recognition / cleavage site for I-Scel endonuclease 
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35S terminator 



10 



15 



20 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mis cofeature 
(515) . . (1222) 
complementary 



coding for I-Scel endonuclease 



promoter 
(1270) (1660) 

complementary: coding for pTOPlOP teracyclin regulatable promoter 



terminator 
(1735) . . (1990) 

complementary: NOS terminator 



misc_feature 
(2067) . . (3173) 
complementary : 



coding for Rhodotorula gracilis D-amino acid oxida 



25 



se 



30 



<220> 
<221> 
<222> 
<223> 



promoter 
(3217) . . (5028) 

complementary: Arabidopsis thaliana nitrilase I promoter 



35 



<220> 
<221> 
<222> 
<223> 



terminator 
(5118) . . (5343) 

complementary: OCS terminator 



40 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(5418) . . (6425) 

complementary: coding for tetracyclin repressor rtTA 



45 



<220> 
<221> 
<222> 
<223> 



promoter 
(6*479) . . (7341) 

complementary: coding for Pisum sativum ptxA promoter 
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<220> 
<221> 
<222> 
<223> 



terminator 
(7345)-. . (7549) 
complementary : 
sequence) 



coding for 23S terminator (functioning as homology 



10 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(7618) . . (7834) 
Agrobacterium left border 



<400> 15 

aatattcaaa 

aaaccttttc 

15 tttacccgcc 
atcagatctg 
aagtatttta 
aaccctataa 
aatagagaga 

20 tgtctcacga 
tgatggatat 
tagtgttcgg 
agatcaggta 
aacagttcag 

25 tgaaagactg 
taccaccatc 
ggttgttcgg 
cttggtgttt 
gttctttttt 

30 tgtatgcttt 
ggatgtaagc 
cgatcagctg 
ggtttttttt 
cgttactagc 

35 cgaagactag 
aagtcgagtt 
cctatcagtg 
aagtgaaagt 
ccactcccta 

40 gagaaaagtg 
tagtggatca 
tctagtaaca 
ttttctatcg 
aataacgtca 

45 atgataatca 



caaacacata 
acgccctttt 
aatatatcct 
gtacccggtc 
caaatacaaa 
gaaccctaat 
gatagatttg 
cgttttgaac 
ctgcagaatt 
cagtttgtac 
agacatagaa 
ttggaatttg 
ggtgttcagt 
atccatgaac 
gatggttttt 
gaaagtctgg 
gtgcggcggg 
gtttttccac 
atcacccagg 
ggatttgtat 
gatgtttttc 
ccgggctcga 
atcactctat 
taccactccc 
atagagaaaa 
cgagtttacc 
tcagtgatag 
aaagtcgagc 
attcactggc 
tagatgacac 
cgtattaaat 
tgcattacat 
tcgcaagacc 



cagcgcgact 
aaatatccga 
gtcaaacact 
actggatttt 
tacatactaa 
tcccttatct 
tagagagaga 
ccagattacc 
cgccctttta 
atcatctgcg 
tcgatgtaga 
ttacgcagac 
acgatcgatt 
cagtatgcca 
ttgttgttaa 
gcgccccagg 
gacagtaccc 
tcgaactgca 
atcagaccga 
tctttcagca 
atatggtcga 
gcaaatgtct 
ctcgagttta 
tatcagtgat 
gtgaaagtcg 
actccctatc 
agaaaatgaa 
tcggtaccga 
cgtcgtttta 
cgcgcgcgat 
gtataattgc 
gttaattatt 
ggcaacagga 



tatcatggac 
ttattctaat 
gatagtttaa 
ggttttagga 
gggtttctta 
gggaactact 
ctggtgattt 
ctgttatccc 
gatcttattt 
ggatcaggta 
tgatcggttt 
ccttaaccag 
tgttggtaga 
gagacatcgg 
cgatgaacag 
tgattaccag 
actgatcgta 
tacagtaggt 
tacctgcttc 
gtttagagtt 
ctaaagggcg 
agaaaggcct 
ccactcccta 
agagaaaagt 
agtttaccac 
agtgatagag 
agtcgagttt 
gctcgaattc 
caacgactca 
aatttatcct 
gggactctaa 
acatgcttaa 
ttcaatctta 



atacaaatgg 
aaacgctctt 
actgaaggcg 
attagaaatt 
tatgctcaac 
cacacattat 
ttgcgccggg 
tagtcgagcg 
caggaaagtt 
cggtttgatc 
gtttttgttg 
gtattctact 
gtttttgttg 
ggtcaggtag 
gctagccagt 
gttacccagg 
cagcagacat 
tttaccttca 
gaactgttcg 
cggacccagg 
aattccagca 
tatatacgta 
tcagtgatag 
gaaagtcgag 
tccctatccg 
aaaagtgaaa 
accactccct 
agcacactgg 
gagcttgaca 
agtttgcgcg 
tcataaaaac 
cgtaattca'a 
agaaacttta 



acgaacggat 60 

ttctcttagg 120 

ggaaacgaca 18 0 

ttattgatag 240 

acatgagcga 300 

tctggagaaa 360 

taccccaaac 420 

gccgccagtg 48 0 

tcggaggaga 54 0 

aggttgtaga 600 

atttttacgt 660 

tcttcgaaag 720 

taatcccatt 78 0 

ttttcaacca 84 0 

ttgttgaaag 900 

tggttaacac 960 

acgtggtcca 1020 

tcacgagaac 108 0 

atgttcagtt 114 0 

ttcattacct 1200 

cactggcggc 1260 

aagggtcttg 1320 

agaaaagtga 138 0 

tttaccactc 1440 

tgatagagaa 1500 

gtcgagttta 1560 

atcagtgata 1620 

cggccgttac 1680 

ggaggcccga 174 0 

ctatattttg 1800 

ccatctcata 1860 

cagaaattat 1920 

ttgccaaatg 198 0 
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tttgaacgat cggggatcat ccgggtctgt 
agcaagatct agagcttggg tcccgcctac 
cgctggaacg cctcgtcgac gagctgcgcg 
tatcccgcac tcgagaagcc atacgcatgc 
5 cgtgcgctgc ccctgccgag cgagaggggc 
cgttctgcct caacgcgggg tccgcctcgt 
aggacctcga tgccttcgat cgttccgtcg 
tgcttgagga tccgctggac cgtctctggg 
tacgtcccgc cgcagatgac ttcgccacct 

10 gggtcggacg agtccatcgt gcatcgcttg 
cggattggct cggcggcttg gtcgtcgatg 
gcgttgacca ccaaatccgc accgtcgaac 
tcaaacgtcg cgccgagctt ctgcagctct 
tggacggaga gggtgtcgta ggttacgccg 

15 agggggcggt aatttggcgt gatgtccttg 
tgcgcgaacc gcctcgtccc cttgagccac 
cacttcttga aagtcgattc ttcccatttt 
aaaggcgtcc aattcgcgcc agcccatggt 
ggcaagtcgc gcgcgagaat atgcacgctg 

20 ctcagaccga taacgcctga tccgaggaca 
cgactagagt cgagatccga- tatcgcccgg 
tgacactcag agacttgaga gaagcaatat 
cacgtttatt ttcctattgg agacggtaac 
aggtgggact agcccacgtg gtttcttttc 

25 aaagttcaca tcaaacctac taataattga 
accagccgat gctttatgtc tgaatataaa 
aacggtggtt tacaaagtat ttggacacaa 
aactaccaaa taaaaaacaa ataaaaaact 
agaatttatt cttttatgga ttccttaaat 

30 taggtaataa tatatactat gttatagata 
ttggtcgtac gattctacct aataatattg 
agtatttttt ttttggtttg tcttagctgt 
tgtcataaat aaaatttgac gtatagattt 
gaaatgtgag tttggctcct atttaatctt 

35 gaattatctc agagaaactg gattaaataa 
ggtcaaatat ctttggattt actattattg 
attcctacaa gagaaatcaa aatttttgaa 
ccaattagtt cgcatcaaga aaaccaaacc 
atatttcatt gcagctttca gataaaagaa 

40 ctttcttact tgcaggaact caaattccac 
attgtcaatg aaacgaaata gtctttttat 
tttctgattg tgacagccat ccatatatat 
acatatacgt aatggtttag catagcttct 
acatttttaa gaaaaataaa ttaatatatg 

45 acagtaactt aattattaaa cattctaaat 



ggcgggaact 


ccacgaaaat 


atccgaacgc 


2040 


aacttcgact 


cccgcgccgc 


gccgtggtac 


2100 


acatcctccg 


ccgcgcccca 


actctgctgg 


2160 


acaagcgtga 


cctccttctc 


cttcgccgct 


2220 


gactttgtcc 


ggtcgagagg 


caggacgatc 


2280 


cgtgcaggtc 


gcaagccgac 


gttgtggcgg 


2340 


ctcgagatgg 


tcgggtcgag 


gcgcaagcag 


2400 


ttgacagaca 


agtcccagtc 


tcccacgccg 


2460 


ggtcggggaa 


tgatgtaggc 


gggagaagcg 


2520 


catggggact 


tgacgaggac 


ggtttgcccg 


2580 


cccgcaatcg 


acttggcgcc 


aagtcccgta 


2640 


gcctgctcaa 


gcgacgtaac 


ggtccgtctc 


2700 


cttgcaaggt 


actggcagta 


ctttggtgcg 


2760 


atagcgccag 


gtggacattc 


ggaagatggg 


2820 


taccagtgcc 


cgagcaagcc 


gtcttcgttc 


2880 


atggcatggc 


ccgtcgggac 


caactcgacc 


2940 


gcttgtcgag 


gaccgtctgt 


aagcgtcatg 


3000 


gaagcgaaag 


tctggctcga 


gacgtcctcc 


3060 


tagcccttcc 


gagcgaggat 


gagggcgctg 


3120 


acgacgcgct 


tctgcgagtg 


catgggccct 


3180 


gctcgagtct 


ttgtttttta 


ctttggttca 


3240 


atagactttt 


ttttgttttt 


tttttgtggt 


3300 


gaagatcgaa 


cctgtggtgg 


aaatgaaaca 


3360 


tctgcattga 


tttgtttttg 


ttttttttgt 


3420 


gaagaaaaat 


aaaatctatt 


gattgattaa 


3480 


aaagaagtga 


aaaccccgtt 


taagaattac 


3540 


taaatccaaa 


cgaaataaaa 


caaaatggag 


3600 


taaaagaatt 


tattccattt 


tttttcccgt 


3660 


ccatatttga 


tgcattttga 


ttcctcataa 


3720 


tgtttctaat 


tcgtattaac 


ctaccttttt 


3780 


aacggaattg 


atgttttgga 


ccacttagaa 


3840 


atttcattaa 


atataaattt 


aaataagaaa 


3900 


tttaaatcca 


ttttatgtta 


tttaatattt 


3960 


aggatgggtt 


aatactaagt 


tttccttaat 


4020 


actaaaaaat 


agatcaatgt 


gttttggtcc 


4080 


gcgaaaagaa 


agtctcatat 


agtaaatcat 


4140 


ttaacatgga 


ttgtatagtt 


tcttatataa 


4200 


ccaattaata 


atcaaacggg 


cttggtagga 


4260 


aaaaacacac 


actcaagtct 


tttatttcat 


4320 


tttgccactt 


ttctttacaa 


ataaacacaa 


4380 


gcaaacactg 


tttgtctttt 


ttcgatcacg 


4440 


agggaatgta 


aaacaacaac 


atgtgaagtc 


4500 


attttcgttg 


tcaatattag 


tcattccaaa 


4560 


tatattcttg 


gaactaatgt 'atgtggaaat 

• 


4620 


gcaaatatgc 


aaagaaaaaa 


aagaaaagaa 


4680 
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cacaactgaa atcaaagcca gattcataat 
aacacaacat ccagaattga acactcaaat 
agagaattgg ttctcttcca ttattaacga 
ttgaaactca acataacatt ttgaacttga 
5 atcacaaccg attacttgtt tgagtcatct 
accggtttct tcgggtgctc atttacggtc 
ggtagggata acagggtaat ctgggttcaa 
aattcgagct cggtaccaat tcccatcttg 
aaataacaag tcaggtatta tagtccaagc 

10 attcagaaat atttcaataa ctgattatat 
gagtgcgata ttatgtgtaa tacataaatt 
atcttgcgcc gggtaccgag ctcggtagca 
ccaggagagt tgttgatcta cccaccgtac 
tgctcaaact cgaagtcggc catatccaga 

15 aatcccggac ccggggaatc cccgtccccc 
tcggcatgcg ccatcgccac gtcctcgccg 
tcggtcgggg gggccgtcga cagtctgcgc 
ggagccgcca gccccgcctc ttcgggggcg 
atggtagacc cgtaattgtt tttcgtacgc 

20 ttaagttgtt tttctaatcc gcatatgatc 
gcaccttggt gatcaaataa ttcgatagct 
gtaggtgttt ccctttcttc tttagcgac't 
aaagtaaaat gccccacagc gctgagtgca 
tggcataaaa aggctaattg attttcgaga 

25 tctgaatgta cttttgctcc attgcgatga 
ttattgcgta aaaaatcttg ccagctttcc 
ctatctaaca tctcaatggc taaggcgtcg 
tacagtgtag gctgctctac accaagcttc 
attccgacct cattaagcag ctctaatgcg 

30 gacatggtcg atcgactcta gactagtgga 
ttcgaagatt ttagtgtaat gtgtgtgctc 
agaatgagtg atgaggttta tatggtgaaa 
tctttctcgt gagtcatatt cacggaccat 
tttttatgtt aaatcattga ttgattttta 

35 atgaattcaa acaaactagg ccagaaaaaa 
aatttgttta tttttttcat ggataagggg 
gacatttttt taacatgtag tgacaagtag 
aatcttaatc tttttttcat aaatctgcac 
gtgctaaact attggtaatt tctttttgta 

40 gaaatgtgtc atcctaaaaa caacattttg 
aatggaagaa gcaatatgtt gatgtttatt 
taattgacaa ctatgatttt atattggatt 
ggtttcaaag catgttatgc tagtgattct 
tccttgaatg gagggacaaa ctttgaatga 

45 ctcagcttca caaaaaattg cggacggtca 



aattggctac 


atggttgtag 


aatgtagggt 


4740 


tggatgatag 


atggataatc 


tttagataca 


4800 


aaataaagaa 


aaaaagttta 


gcataaaagt 


4860 


ctccttcata 


ggagtgacat 


gaactgacga 


4920 


tccgctttct 


ccaccttcga 


aatgaatgtg 


4980 


aagtgtaaaa 


catctggtct 


cgaggtacct 


5040 


aacgtcgtga 


gacagtttgg 


tgcaggtcga 


5100 


aaagaaatat 


agtttaaata 


tttattgata 


5160 


aaaaacataa 


atttattgat 


gcaagtttaa 


5220 


cagctggtac 


attgccgtag 


atgaaagact 


5280 


gatgatatag 


ctagcttagc 


tcatcggggg 


5340 


attcccgagg 


ctgtagccga 


cgatggtgcg 


5400 


tcgtcaattc 


caagggcatc 


ggtaaacatc 


5460 


gcgccgtagg 


gggcggagtc 


gtggggggta 


5520 


aacatgtcca 


gatcgaaatc 


gtctagcgcg 


5580 


tctaagtgga 


gctcgtcccc 


caggctgaca 


5640 


gtgtgtcccg 


cggggagaaa 


ggacaggcgc 


5700 


tcgtcgtccg 


ggagatcgag 


caggccctcg 


5760 


gcgcggctgt 


acgcggaccc 


actttcacat 


5820 


aattcaaggc 


cgaataagaa 


ggctggctct 


5880 


tgtcgtaata 


atggcggcat 


actatcagta 


5940 


tgatgctctt 


gatcttccaa 


tacgcaacct 


6000 


tataacgcgt 


tctctagtga 


aaaaccttgt 


6060 


gtttcatact 


gtttttctgt 


aggccgtgta 


6120 


cttagtaaag 


cacatctaaa 


acttttagcg 


6180 


ccttttaaag 


ggcaaaagtg 


agtatggtgc 


6240 


agcaaagccc 


gcttattttt 


tacatgccaa 


6300 


tgggcgagtt 


tacgggttgt 


taaaccttcg 


6360 


ctgttaatca 


ctttactttt 


atctaatcta 


6420 


tccgatatcg 


cccgggctcg 


actctagagt 


6480 


actactatga 


agctttgcac 


ttaaaaaaat 


6540 


aaaactatga 


aattttgata 


ttttgatata 


6600 


gttgcagcaa 


attggaatta 


aactattcat 


. 6660 


gtgggcctcg 


ttacatattc 


aagagttaga 


6720 


ggatgtgggg 


ccattttttt 


gtgtcttaag 


6780 


aatcaatgga 


aaaagtttga 


tgtactagag 


6840 


tgctattatt 


cgacccgtga 


tgaaaggggc 


6900 


atgtgatgct 


ttaattatgc 


tttagacttt 


6960 


atcgaatcaa 


gtatctttta 


aactatgtat 


7020 


ctagttttag 


actttgatgt 


ttatatgctt 


7080 


gggtaaaaga 


aagggacttg 


attgagtatg 


7140 


tgatattcct 


aacattaatt 


taagtgtgtg 


7200 


tgtgtttgat 


gcttgaaaaa 


tctacattca 


7260 


cttttgaata 


ggtgtaaaat 


ccaatcctcc 


7320 


ctggattttg 


gttttaggaa 


ttagaaattt 


7380 
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tattgataga agtattttac aaatacaaat acatactaag ggtttcttat atgctcaaca 7440 

catgagcgaa accctataag aaccctaatt cccttatctg ggaactactc acacattatt 7500 

ctggagaaaa atagagagag atagatttgt agagagagac tggtgatttt ccgggggatc 7560 

ctctagagtc gaggtaccga gctcgaattc actggccgtc gttttacaac gactcagtac 7 620 

5 tgcttggtaa taattgtcat tagattgttt ttatgcatag atgcactcga aatcagccaa 7 680 

ttttagacaa gtatcaaacg gatgttaatt cagtacatta aagacgtccg caatgtgtta 7740 

ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 7800 

cagctccccg accggcagct cggcacaaaa tcaccacgcg tctaaaaagg tgatgtgtat 7 8 60 

ttgagtaaaa cagcttgcgt catgcggtcg ctgcgtatat gatgcgatga gtaaataaac 7 920 

10 aaatacgcaa ggggaacgca tgaaggttat cgctgtactt aaccagaaag gcgggtcagg 7 98 0 

caagacgacc atcgcaaccc atctagcccg cgccctgcaa ctcgccgggg ccgatgttct 8 04 0 

gttagtcgat tccgatcccc agggcagtgc ccgcgattgg gcggccgtgc gggaagatca 8100 

accgctaacc gttgtcggca tcgaccgccc gacgattgac cgcgacgtga aggccatcgg 8160 

ccggcgcgac ttcgtagtga tcgacggagc gccccaggcg gcggacttgg ctgtgtccgc 8220 

15 gatcaaggca gccgacttcg tgctgattcc ggtgcagcca agcccttacg acatatgggc. 8280 

caccgccgac ctggtggagc tggttaagca gcgcattgag gtcacggatg gaaggctaca 834 0 

agcggccttt gtcgtgtcgc gggcgatcaa aggcacgcgc atcggcggtg aggttgccga 8 400 

ggcgctggcc gggtacgagc tgcccattct tgagtcccgt atcacgcagc gcgtgagcta 8 4 60 

cccaggcact gccgccgccg gcacaaccgt tcttgaatca gaacccgagg gcgacgctgc 8520 

20 ccgcgaggtc caggcgctgg ccgctgaaat taaatcaaaa ctcatttgag ttaatgaggt 858 0 

aaagagaaaa tgagcaaaag cacaaacacg ctaagtgccg gccgtccgag cgcacgcagc 8 64 0 

agcaaggctg caacgttggc cagcctggca gacacgccag. ccatgaagcg ggtcaacttt 8700 

cagttgccgg cggaggatca caccaagctg aagatgtacg cggtacgcca aggcaagacc 87 60 

attaccgagc tgctatctga atacatcgcg cagctaccag agtaaatgag caaatgaata 8 820 

25 aatgagtaga tgaattttag cggctaaagg aggcggcatg gaaaatcaag aacaaccagg 888 0 

caccgacgcc gtggaatgcc ccatgtgtgg aggaacgggc ggttggccag gcgtaagcgg 8 94 0 

ctgggttgtc tgccggccct gcaatggcac tggaaccccc aagcccgagg aatcggcgtg 9000 

agcggtcgca aaccatccgg cccggtacaa atcggcgcgg cgctgggtga tgacctggtg 90 60 

gagaagttga aggccgcgca ggccgcccag cggcaacgca tcgaggcaga agacgccccg 9120 

30 gtgaatcgtg gcaaggggcc gctgatcgaa tccgcaaaga atcccggcaa ■ ccgccggcag 9180 

ccggtgcgcc gtcgattagg aagccgccca agggcgacga gcaaccagat tttttcgttc 924 0 

cgatgctcta tgacgtgggc acccgcgata gtcgcagcat catggacgtg gccgttttcc 9300 

gtctgtcgaa gcgtgaccga cgagctggcg aggtgatccg ctacgagctt ccagacgggc 9360 

acgtagaggt ttccgcaggg ccggccggca tggccagtgt gtgggattac gacctggtac 9420 

35 tgatggcggt ttcccatcta accgaatcca tgaaccgata ccgggaaggg aagggagaca 94 8 0 

agcccggccg cgtgttccgt ccacacgttg cggacgtact caagttctgc cggcgagccg 954 0 

atggcggaaa gcagaaagac gacctggtag aaacctgcat tcggttaaac accacgcacg 9600 

ttgccatgca gcgtacgaag aaggccaaga acggccgcct ggtgacggta tccgagggtg 9660 

aagccttgat tagccgctac aagatcgtaa agagcgaaac cgggcggccg gagtacatcg 9720 

40 agatcgagct agctgattgg atgtaccgcg agatcacaga aggcaagaac ccggacgtgc 978 0 

tgacggttca ccccgattac tttttgatcg atcccggcat cggccgtttt ctctaccgcc 984 0 

tggcacgccg cgccgcaggc aaggcagaag ccagatggtt gttcaagacg atctacgaac 9900 

gcagtggcag cgccggagag ttcaagaagt tctgtttcac cgtgcgcaag ctgatcgggt 9960 

caaatgacct gccggagtac gatttgaagg aggaggcggg gcaggctggc ccgatcctag 10020 

45 tcatgcgcta ccgcaacctg atcgagggcg aagcatccgc cggttcctaa tgtacggagc 1008 0 
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agatgctagg gcaaattgcc ctagcagggg 
atagcacgta cattgggaac ccaaagccgt 
acccaaagcc gtacattggg aaccggtcac 
aaggcgattt ttccgcctaa aactctttaa 
5 cctgtgcata actgtctggc cagcgcacag 
ggtcgctgcg ctccctacgc cccgccgctt 
gaaataccgc acagatgcgt aaggagaaaa 
ctcactgact cgctgcgctc ggtcgttcgg 
gcggtaatac ggttatccac agaatcaggg 

10 ggccagcaaa aggccaggaa ccgtaaaaag 
cgcccccctg acgagcatca caaaaatcga 
ggactataaa gataccaggc gtttccccct 
accctgccgc ttaccggata cctgtccgcc 
catagctcac gctgtaggta tctcagttcg 

15 gtgcacgaac cccccgttca gcccgaccgc 
tccaacccgg taagacacga cttatcgcca 
agagcgaggt atgtaggcgg tgctacagag 
actagaagga cagtatttgg tatctgcgct 
gttggtagct cttgatccgg caaacaaacc 

20 aagcagcaga ttacgcgcag aaaaaaagga 
gggtctgacg ctcagtggaa cgaaaactca 
tctcccaatt tgtgtagggc ttattatgca 
tgatagtttg gctgtgagca attatgtgct 
ccgcgaagcg gcgtcggctt gaacgaattt 

25 gtgatctcgc ctttcacgta gtggacaaat 
cgatcttctt cttgtccaag ataagcctgt 
gccggcaggc gctccattgc ccagtcggca 
actgcgctgt accaaatgcg ggacaacgta 
tcgggcggcg agttccatag cgttaaggtt 
" 30 ggaaccggat caaagagttc ctccgccgct 
gcttttgtca gcaa'gatagc cagatcaatg 
agaatgtcat tgcgctgcca ttctccaaat 
ggaatgatgt cgtcgtgcac aacaatggtg 
ccaggggaag ccgaagtttc caaaaggtcg 

35 agccttacgg tcaccgtaac cagcaaatca 
actgcggagc cgtacaaatg tacggccagc 
ccaactacct ctgatagttg agtcgatact 
aactttgttt tagggcgact gccctgctgc 
catcgaccca cggcgtaacg cgcttgctgc 

40 aaaaaacagt cataacaagc catgaaaacc 



aaaaaggtcg aaaaggtctc tttcctgtgg 1014 0 

acattgggaa ccggaacccg tacattggga 10200 

acatgtaagt gactgatata aaagagaaaa 102 60 

aacttattaa aactcttaaa acccgcctgg 10320 

ccgaagagct gcaaaaagcg cctacccttc 1038 0 

cgcgtcggcc tatcgcggcc tatgcggtgt 10440 

taccgcatca ggcgctcttc cgcttcctcg 10500 

ctgcggcgag cggtatcagc tcactcaaag 10560 

gataacgcag gaaagaacat gtgagcaaaa 10620 

gccgcgttgc tggcgttttt ccataggctc 10 68 0 

cgctcaagtc agaggtggcg aaacccgaca 1074 0 

ggaagctccc tcgtgcgctc tcctgttccg 10800 

tttctccctt cgggaagcgt ggcgctttct 108 60 

gtgtaggtcg ttcgctccaa gctgggctgt 10920 

tgcgccttat ccggtaacta tcgtcttgag 1098 0 

ctggcagcag ccactggtaa caggattagc 11040 

ttcttgaagt ggtggcctaa ctacggctac 11100 

ctgctgaagc cagttacctt cggaaaaaga 11160 

accgctggta gcggtggttt ttttgtttgc 11220 

tctcaagaag atcctttgat cttttctacg 11280 

cgttaaggga ttttggtcat gcatgatata 11340 

cgcttaaaaa taataaaagc agacttgacc 11400 

tagtgcatct aacgcttgag ttaagccgcg 114 60 

ctagctagac attatttgcc gactaccttg 11520 

tcttccaact gatctgcgcg cgaggccaag 11580 

ctagcttcaa gtatgacggg ctgatactgg 1164 0 

gcgacatcct tcggcgcgat tttgccggtt 11700 

agcactacat ttcgctcatc gccagcccag 117 60 

tcatttagcg cctcaaatag atcctgttca 11820 

ggacctacca aggcaacgct atgttctctt 11880 

tcgatcgtgg ctggctcgaa gatacctgca 1194 0 

tgcagttcgc gcttagctgg ataacgccac 12000 

acttctacag cgcggagaat ctcgctctct 12060 

ttgatcaaag ctcgccgcgt tgtttcatca 12120 

atatcactgt gtggcttcag gccgccatcc 12180 

aacgtcggtt cgagatggcg ctcgatgacg 1224 0 

tcggcgatca ccgcttcccc catgatgttt 12300 

gtaacatcgt tgctgctcca taacatcaaa 12360 

ttggatgccc gaggcataga ctgtacccca 12420 

gccactgcgt tccatg 124 66 



<210> 16 
<211> 12539 
45 <212> DNA 
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<213> daaoNit-PRecombination 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(38) . . (183) 

Agrobacterium right border 



10 



<220> 
<221> 
<222> 
<223> 



terminator 
(196) . . (400) 

complementary: 35S terminator 



15 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(445) . . (462) 

cleavage / recognition site for I-Scel endonuclease 



20 



<220> 
<221> 
<222> 
<223> 



terminator 
(589) . . (844) 

complementary: nos terminator 



25 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(921) . . (2027) 
complementary 



coding for Rhodotorula gracilis D-amino acid oxida 



se 



30 



35 



40 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



promoter 
(2071) . . (3882) 

complementary: A-thaliana nitrilase I promoter 



terminator 
(3972) . . (4176) 

complementary: 35S terminator 



mis cofeature 
(4251) . . (6248) 

complementary: coding for beta-glucuronidase 



10 
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<220> 

<221> promoter 
<222> (6302) . . (7619) 

<223> complementar. coding for sTPT promoter 
<220> 

<221> misc^feature 

<222> (7691) . . (7907) 

<223> Agrobacterium left border 



<400> 16 

aatattcaaa caaacacata cagcgcgact tatcatggac atacaaatgg acgaacggat 60 

aaaccttttc acgccctttt aaatatccga ttattctaat aaacgctctt ttctcttagg 120 

tttacccgcc aatatatcct gtcaaacact gatagtttaa actgaaggcg ggaaacgaca 180 

15 atcagatctg gtacccggtc actggatttt ggttttagga attagaaatt ttattgatag 240 

aagtatttta caaatacaaa tacatactaa gggtttctta tatgctcaac acatgagcga 300 

aaccctataa gaaccctaat tcccttatct gggaactact cacacattat tctggagaaa 360 

aatagagaga gatagatttg tagagagaga ctggtgattt ttgcgccggg taccccaaac 420 

tgtctcacga cgttttgaac ccagattacc ctgttatccc tagtcgagcg gccgccagtg 48 0 

20 tgatggatat ctgcagaatt cgccctttta gatcagcaca ctggcggccg ttactagtgg 54 0 

atcaattcac tggccgtcgt . tttacaacga ctcagagctt gacaggaggc ccgatctagt 600' 

aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat tttgttttct 660 

atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac 720 

gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa ttatatgata 780 

25 atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa 84 0 

cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga acgcagcaag 900 

atctagagct tgggtcccgc ctacaacttc gactcccgcg ccgcgccgtg gtaccgctgg 960 

aacgcctcgt cgacgagctg cgcgacatcc tccgccgcgc cccaactctg ctggtatccc 1020 

gcactcgaga agccatacgc atgcacaagc gtgacctcct tctccttcgc cgctcgtgcg 108 0 

30 ctgcccctgc cgagcgagag gggcgacttt gtccggtcga gaggcaggac gatccgttct 114 0 

gcctcaacgc ggggtccgcc tcgtcgtgca ggtcgcaagc cgacgttgtg gcggaggacc 1200 

tcgatgcctt cgatcgttcc gtcgctcgag atggtcgggt cgaggcgcaa gcagtgcttg 1.2 60 

aggatccgct ggaccgtctc tgggttgaca gacaagtccc agtctcccac gccgtacgtc 1320 

ccgccgcaga tgacttcgcc acctggtcgg ggaatgatgt aggcgggaga agcggggtcg 138 0 

35 gacgagtcca tcgtgcatcg cttgcatggg gacttgacga ggacggtttg cccgcggatt 1440 

ggctcggcgg cttggtcgtc gatgcccgca atcgacttgg cgccaagtcc cgtagcgttg 1500 

accaccaaat ccgcaccgtc gaacgcctgc tcaagcgacg taacggtccg tctctcaaac 1560 

gtcgcgccga gcttctgcag ctctcttgca aggtactggc agtactttgg tgcgtggacg 1620 

gagagggtgt cgtaggttac gccgatagcg ccaggtggac attcggaaga tgggaggggg 168 0 

40 cggtaatttg gcgtgatgtc cttgtaccag tgcccgagca ' agccgtcttc gttctgcgcg 1740 

aaccgcctcg tccccttgag ccacatggca tggcccgtcg ggaccaactc gacccacttc 1800 

ttgaaagtcg attcttccca ttttgcttgt cgaggaccgt ctgtaagcgt catgaaaggc 18 60 

gtccaattcg cgccagccca tggtgaagcg aaagtctggc tcgagacgtc ctccggcaag 1920 

tcgcgcgcga gaatatgcac gctgtagccc ttccgagcga ggatgagggc gctgctcaga '198 0 

45 ccgataacgc ctgatccgag gacaacgacg cgcttctgcg agtgcatggg ccctcgacta 2040 
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gagtcgagat ccgatatcgc ccgggctcga gtctttgttt tttactttgg ttcatgacac 2100 

tcagagactt gagagaagca atatatagac ttttttttgt tttttttttg tggtcacgtt 2160 

ta'ttttccta ttggagacgg taacgaagat cgaacctgtg gtggaaatga aacaaggtgg 222 0 

gactagccca cgtggtttct tttctctgca ttgatttgtt tttgtttttt ttgtaaagtt 2280 

5 cacatcaaac ctactaataa ttgagaagaa aaataaaatc tattgattga ttaaaccagc 2340 

cgatgcttta tgtctgaata taaaaaagaa gtgaaaaccc cgtttaagaa ttacaacggt 24 00 

ggtttacaaa gtatttggac acaataaatc caaacgaaat aaaacaaaat ggagaactac 24 60 

caaataaaaa acaaataaaa aacttaaaag aatttattcc attttttttc ccgtagaatt 252 0 

tattctttta tggattcctt aaatccatat ttgatgcatt ttgattcctc ataataggta 2580 

10 ataatatata ctatgttata gatatgtttc taattcgtat taacctacct ttttttggtc 2 64 0 

gtacgattct acctaataat attgaacgga attgatgttt tggaccactt agaaagtatt 27 00 

ttttttttgg tttgtcttag ctgtatttca ttaaatataa atttaaataa gaaatgtcat 27 60 

aaataaaatt tgacgtatag attttttaaa tccattttat gttatttaat atttgaaatg 2820 

tgagtttggc tcctatttaa tcttaggatg ggttaatact aagttttcct taatgaatta 2880 

15 tctcagagaa actggattaa ataaactaaa aaatagatca atgtgttttg gtccggtcaa 2 94 0 

atatctttgg atttactatt attggcgaaa agaaagtctc atatagtaaa tcatattcct 3000 

acaagagaaa tcaaaatttt tgaattaaca tggattgtat agtttcttat ataaccaatt 3060 

agttcgcatc aagaaaacca aaccccaatt aataatcaaa cgggcttggt aggaatattt 312 0 

cattgcagct ttcagataaa agaaaaaaac acacactcaa gtcttttatt tcatctttct 318 0 

20 tacttgcagg aactcaaatt ccactttgcc acttttcttt acaaataaac acaaattgtc 324 0 

aatgaaacga aatagtcttt ttatgcaaac actgtttgtc ttttttcgat cacgtttctg 3300 

attgtgacag ccatccatat atatagggaa tgtaaaacaa' caacatgtga agtcacatat 33 60 

acgtaatggt ttagcatagc ttctattttc gttgtcaata ttagtcattc caaaacattt 3420 

ttaagaaaaa taaattaata tatgtatatt cttggaacta atgtatgtgg aaatacagta 34 8 0 

25 acttaattat taaacattct aaatgcaaat atgcaaagaa aaaaaagaaa agaacacaac 354 0 

tgaaatcaaa gccagattca taataattgg ctacatggtt gtagaatgta gggtaacaca 3600 

acatccagaa ttgaacactc aaattggatg atagatggat aatctttaga tacaagagaa 3660 

ttggttctct tccattatta acgaaaataa agaaaaaaag tttagcataa aagtttgaaa 37 20 

c'tcaacataa cattttgaac ttgactcctt cataggagtg acatgaactg acgaatcaca 37 8 0 

30 accgattact tgtttgagtc atcttccgct • ttctccacct tcgaaatgaa tgtgaccggt 38 4 0 

ttcttcgggt gctcatttac ggtcaagtgt aaaacatctg gtctcgaggt acctggtagg 3900 

gataacaggg taatctgggt tcaaaacgtc gtgagacagt ttggtgcagg tcgaaattcg 3960 

agctcggtac ccggtcactg gattttggtt ttaggaatta gaaattttat tgatagaagt 4020 

attttacaaa tacaaataca tactaagggt ttcttatatg ctcaacacat gagcgaaacc 4080 

35 ctataagaac cctaattccc ttatctggga actactcaca cattattctg gagaaaaata 414 0 

gagagagata gatttgtaga gagagactgg tgatttttgc gccgggtacc gagctcggta 4200 

gcaattcccg aggctgtagc cgacgatggt gcgccaggag agttgttgat tcattgtttg 4260 

cctccctgct gcggtttttc accgaagttc atgccagtcc agcgtttttg cagcagaaaa 4320 

gccgccgact tcggtttgcg gtcgcgagtg aagatccctt tcttgttacc gccaacgcgc 438 0 

40 aatatgcctt gcgaggtcgc aaaatcggcg aaattccata cctgttcacc gacgacggcg 44 4 0 

ctgacgcgat caaagacgcg gtgatacata tccagccatg cacactgata ctcttcactc 4500 

cacatgtcgg tgtacattga gtgcagcccg gctaacgtat ccacgccgta ttcggtgatg 45 60 

ataatcggct gatgcagttt ctcctgccag gccagaagtt ctttttccag taccttctct 4 62 0 

gccgtttcca aatcgccgct ttggac'atac catccgtaat aacggttcag gcacagcaca 4*68 0 

45 tcaaagagat cgctgatggt atcggtgtga gcgtcgcaga acattacatt gacgcaggtg 474 0 
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atcggacgcg tcgggtcgag tttacgcgtt 
acttgcggac gggtatccgg ttcgttggca 
ttgtcacgcg ctatcagctc tttaatcgcc 
actgcctctt cgctgtacag ttctttcggc 
5 gagaggttaa agccgacagc agcagtttca 
cagtcgagca tctcttcagc gtaagggtaa 
cagtccatta atgcgtggtc gtgcaccatc 
tccgcatctt catgacgacc aaagccagta 
tgttcgccct tcactgccac tgaccggatg 

10 gtctggcttt tggctgtgac gcacagttca 
tgcggattca ccacttgcaa agtcccgcta 
tccgcatcac gcagttcaac gctgacatca 
gcgtggttac agtcttgcgc gacatgcgtc 
ggcgtggtgt agagcattac. gctgcgatgg 

15 taagactgct ttttcttgcc gttttcgtcg 
cagttcagtt cgttgttcac acaaacggtg 
tattagaaaa gttataaatt aaaatataca 
tatactacat tcttttattt tgaaaaaaat 
gataattatt atatatatat. caaaggtaga 

20 ataacatacg gcgtgacatc ggcttcaaat 
tcctgattat tgacccacac tttgccgtaa 
cgctggcctg cccaaccttt . cggtataaag 
taattacgaa. tatctgcatc- ggcgaactga 
cggctttctt gtaacgcgct ttcccaccaa 

25 agactgaatg cccacaggcc gtcgagtttt 
cgtaccatgg tcgatcgact ctagactagt 
atgaaatcga aattcagagt tttgatagtg 
tcgagtattt caagagatgg tacttgttga 
aatccagtat ctcgcggtgt atggagttat 

30 gtagagaatg atgtaattag ataagaatct 
gggtccatct tatcttgata agtggatggt 
gaagttccat atcaccatca tcatctgtat 
tgaaattttg gtagatttta tttttcatta 
tattgaacgt gacagattca caagaaagta 

35 taaaggtatc tctatttaat ttcttttgtt 
tttcatgctc atagtcatat acgttattct 
cgtaatacaa ttttataatt ttatcaaagt 
tacaaaacaa aaaaatcttc taattttttt 
aaaaatacaa atgaacccac atgtattatg 

40 gtactcaatt caccctatac tctttttttt 
ttctgtccat ttgaccaaat tcaattaatt 
atgttcacta tttctcgatt tgagggaacc 
atcactacaa atatgttata gtttatactt 
attaagtaaa catgattctt cttatgaagt 

45 gtgattgatg ttattggttg attgagtgat 



gcttccgcca 


gtggcgaaat 


attcccgtgc 


4800 


atactccaca 


tcaccacgct 


tgggtggttt 


4860 


tgtaagtgcg 


cttgctgagt 


ttccccgttg 


4920 


ttgttgcccg 


cttcgaaacc 


aatgcctaaa 


4980 


tcaatcacca 


cgatgccatg ttcatctgcc 


5040 


tgcgaggtac 


ggtaggagtt 


ggccccaatc 


5100 


agcacgttat 


cgaatccttt 


gccacgtaag 


5160 


aagtagaacg 


gtttgtggtt 


aatcaggaac 


5220 


ccgacgcgaa 


gcgggtagat 


atcacactct 


5280 


tagagataac 


cttcacccgg ttgccagagg 


5340 


gtgccttgtc 


cagttgcaac 


cacctgttga 


5400 


ccattggcca 


ccacctgcca 


gtcaacagac 


5460 


accacggtga 


tatcgtccac 


ccaggtgttc 


5520 


attccggcat 


agttaaagaa 


atcatggaag 


5580 


gtaatcacca 


ttcccggcgg 


gatagtctgc 


5640 


atacctgcac 


atcaacaaat 


tttggtcata 


5700 


cacttataaa 


ctacagaaaa 


gcaattgcta 


5760 


atttgaaata 


ttatattact 


actaattaat 


5820 


agcagaaact 


tacgtacact 


tttcccggca 


5880 


ggcgtatagc 


cgccctgatg 


ctccatcact 


5940 


tgagtgaccg 


catcgaaacg 


cagcacgata 


6000 


acttcgcgct 


gataccagac 


gttgcccgca 


6060 


tcgttaaaac 


tgcctggcac 


agcaattgcc 


6120 


. cgctgaccaa 


ttccacagtt 


ttcgcgatcc 


618Q 


ttgatttcac 


gggttggggt 


ttctacagga 


6240 


ggatccgata 


tcgcccgggc 


tcgactctag 


6300 


agagcaaaga 


gggacggact 


tatgaggatt 


6360 


tcggacggct 


acgatgatct 


cgatttggtt 


6420 


ggtagggtta 


atggtcaatt 


tcatctaacg 


6480 


tgagatactg 


gtttagattg 


gatgagtgta 


6540 


ttttagagac 


acagtgaata 


ttagccaatc 


6600 


aattttgttt 


ttttggaaga 


taataatgat 


6660 


tttaccttgt 


atgttgagtg 


gtcttcaaat 


6720 


gattttttat 


aaatgaaatt 


ttacttattt 


6780 


tatggttgtc 


tgtcagcatt 


tgacttgcag 


6840 


aggctttttt 


gaatatctta 


ttactttttt 


6900 


tatacaacta 


taactaaaat 


tagggttttc 


6960 


tgttgtagcc 


agtttactcg 


taagttacaa 


7020 


cgtttaacta 


ggattaccat 


gtactttcat 


7080 


tttttttcta 


gttccaccca 


atctataaaa 


7140 


tctgtaattg 


cgatttaaaa 


ttaatattac 


7200 


cgagtttaaa 


tatgataaaa 


atgttgaccc 


7260 


aatagtggtg 


tttttgggga 


taattgatga 


7320 


tgattgagtg 


attattgtat 


gtaaacctat 


7380 ' 


tattgtatta 


gtatgtaagc 


aaagatgatt 


7440 
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gttcttatga ggtaatttgt tactcattca tccttttgca tatgagaaat tgtgttagcg 7500 

tacgcaaaac aatagagaac ataaaagata tgtgtattta tttaaggtga cttttgttaa 7560 

tgatattgta gtatctatac atttatatat aacttgttga atttgagtat aagctatcag 7 620 

gatccggggg atcctctaga gtcgaggtac cgagctcgaa ttcactggcc gtcgttttac 7 680 

5 aacgactcag tactgcttgg taataattgt cattagattg tttttatgca tagatgcact 774 0 

cgaaatcagc caattttaga caagtatcaa acggatgtta attcagtaca ttaaagacgt 7 800 

ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat atatcctgcc 7 8 60 

accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac gcgtctaaaa 7 920 

aggtgatgtg tatttgagta aaacagcttg cgtcatgcgg tcgctgcgta tatgatgcga 7 98 0 

10 tgagtaaata aacaaatacg caaggggaac gcatgaaggt tatcgctgta cttaaccaga 8 04 0 

aaggcgggtc aggcaagacg accatcgcaa cccatctagc ccgcgccctg caactcgccg 8100 

gggccgatgt tctgttagtc gattccgatc cccagggcag tgcccgcgat tgggcggccg 8160 

tgcgggaaga tcaaccgcta accgttgtcg gcatcgaccg cccgacgatt gaccgcgacg 8220 

tgaaggccat cggccggcgc gacttcgtag tgatcgacgg agcgccccag gcggcggact 828 0 

15 . tggctgtgtc cgcgatcaag gcagccgact, tcgtgctgat tccggtgcag ccaagccctt 8340 

acgacatatg ggccaccgcc gacctggtgg agctggttaa gcagcgcatt gaggtcacgg 8400 

atggaaggct acaagcggcc tttgtcgtgt cgcgggcgat caaaggcacg cgcatcggcg 8460 

gtgaggttgc . cgaggcgctg gccgggtacg. agctgcccat tcttgagtcc cgtatcacgc 8520 

agcgcgtgag ctacccaggc actgccgccg ccggcacaac cgttcttgaa tcagaacccg 8580 

20 agggcgacgc tgcccgcgag gtccaggcgc tggccgctga aattaaatca aaactcattt 8 64 0 

gagttaatga ggtaaagaga aaatgagcaa aagcacaaac : acgctaagtg ccggccgtcc 8700 ■ 

gagcgcacgc agcagcaagg ctgcaacgtt ggccagcctg gcagacacgc cagccatgaa 87 60 

gcgggtcaac tttcagttgc cggcggagga tcacaccaag ctgaagatgt .acgcggtacg 8820 

ccaaggcaag accattaccg agctgctatc tgaatacatc gcgcagctac cagagtaaat 8880 

25 gagcaaatga ataaatgagt agatgaattt. tagcggctaa aggaggcggc atggaaaatc 8 94 0 

aagaacaacc aggcaccgac gccgtggaat gccccatgtg tggaggaacg ggcggttggc 9000 

caggcgtaag cggctgggtt gtctgccggc cctgcaatgg cactggaacc cccaagcccg 90 60 

aggaatcggc gtgagcggtc gcaaaccatc cggcccggta caaatcggcg cggcgctggg 9120 

tgatgacctg gtggagaagt tgaaggccgc gcaggccgcc cagcggcaac. gcatcgaggc 9180 

30 agaagacgcc ccggtgaatc gtggcaaggg gccgctgatc gaatccgcaa agaatcccgg 9240 

caaccgccgg cagccggtgc gccgtcgatt aggaagccgc ccaagggcga cgagcaacca 9300 

gattttttcg ttccgatgct ctatgacgtg ggcacccgcg atagtcgcag catcatggac 9360 

gtggccgttt tccgtctgtc gaagcgtgac cgacgagctg gcgaggtgat ccgctacgag 9420 

cttccagacg ggcacgtaga ggtttccgca gggccggccg gcatggccag tgtgtgggat 94 8 0 

35 tacgacctgg tactgatggc ggtttcccat ctaaccgaat ccatgaaccg ataccgggaa 954 0 

gggaagggag acaagcccgg ccgcgtgttc cgtccacacg ttgcggacgt actcaagttc 9600 

tgccggcgag ccgatggcgg aaagcagaaa gacgacctgg tagaaacctg cattcggtta 9660 

aacaccacgc acgttgccat gcagcgtacg aagaaggcca agaacggccg cctggtgacg 9720 

gtatccgagg gtgaagcctt gattagccgc tacaagatcg taaagagcga aaccgggcgg 97 8 0 

40' ccggagtaca tcgagatcga gctagctgat tggatgtacc gcgagatcac agaaggcaag 98 4 0 

aacccggacg tgctgacggt tcaccccgat tactttttga tcgatcccgg catcggccgt 9900 

tttctctacc gcctggcacg ccgcgccgca ggcaaggcag aagccagatg gttgttcaag 9960 

acgatctacg aacgcagtgg cagcgccgga gagttcaaga agttctgttt caccgtgcgc 10020 

aagctgatcg ggtcaaatga cctgccggag tacgatttga aggaggaggc ' ggggcaggct 10080 

45 ggcccgatcc tagtcatgcg ctaccgcaac ctgatcgagg gcgaagcatc cgccggttcc 10140 
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taatgtacgg 
ctctttcctg 
ccgtacattg 
ataaaagaga 
5 ■ aaaacccgcc 
gcgcctaccc 
gcctatgcgg 
ttccgcttcc 
agctcactca 

10 catgtgagca 
tttccatagg 
gcgaaacccg 
ctctcctgtt 
cgtggcgctt 

15 caagctgggc 
ctatcgtctt 
taacaggatt 
taactacggc 
cttcggaaaa 

20 • tttttttgtt 
gatcttttct 
catgcatgat 
agcagacttg 
gagttaagcc 

25 gccgactacc 
gcgcgaggcc 
gggctgatac 
gattttgccg 
atcgccagcc 

30 ta'gatcctgt 
gctatgttct 
gaagatacct 
tggataacgc 
aatctcgctc 

35 cgttgtttca 
caggccgcca 
gcgctcgatg 
ccccatgatg 
ccataacatc 

40 agactgtacc 



agcagatgct 
tggatagcac 
ggaacccaaa 
aaaaaggcga 
tggcctgtgc 
ttcggtcgct 
tgtgaaatac 
tcgctcactg 
aaggcggtaa 
aaaggccagc 
ctccgccccc 
acaggactat 
ccgaccctgc 
tctcatagct 
tgtgtgcacg 
gagtccaacc 
agcagagcga 
tacactagaa 
agagttggta 
tgcaagcagc 
acggggtctg 
atatctccca : 
acctgatagt 
gcgccgcgaa 
ttggtgatct 
aagcgatctt 
tgggccggca 
gttactgcgc 
cagtcgggcg 
tcaggaaccg 
cttgcttttg 
gcaagaatgt 
cacggaatga 
tctccagggg 
tcaagcctta 
t'ccactgcgg 
acgccaacta 
tttaactttg 
aaacatcgac 
ccaaaaaaac 



agggcaaatt 
gtacattggg 
gccgtacatt 
tttttccgcc 
ataactgtct 
gcgctcccta 
cgcacagatg 
actcgctgcg 
tacggttatc 
aaaaggccag 
ctgacgagca 
aaagatacca 
cgcttaccgg 
cacgctgtag 
aaccccccgt 
cggtaagaca 
ggtatgtagg 
ggacagtatt 
gctcttgatc 
agattacgcg 
acgctcagtg 
atttgtgtag 
ttggctgtga 
gcggcgtcgg 
cgcctttcac 
cttcttgtcc 
ggcgctccat 
tgtaccaaat 
gcgagttcca 
gatcaaagag 
tcagcaagat 
cattgcgctg 
tgtcgtcgtg 
aagccgaagt 
cggtcaccgt 
agccgtacaa 
cctctgatag 
ttttagggcg 
ccacggcgta 
agtcataaca 



gccctagcag 
aacccaaagc 
gggaaccggt 
taaaactctt 
ggccagcgca 
cgccccgccg 
cgtaaggaga 
ctcggtcgtt 
cacagaatca 
gaaccgtaaa 
tcacaaaaat 
ggcgtttccc 
atacctgtcc 
gtatctcagt 
tcagcccgac 
cgacttatcg 
cggtgctaca 
tggtatctgc 
cggcaaacaa 
cagaaaaaaa 
gaacgaaaac 
•ggcttattat 
gcaattatgt 
cttgaacgaa 
gtagtggaca 
aagataagcc 
tgcccagtcg 
gcgggacaac 
tagcgttaag 
ttcctccgcc 
agccagatca 
ccattctcca 
cacaacaatg 
ttccaaaagg 
aaccagcaaa 
atgtacggcc 
ttgagtcgat 
actgccctgc 
acgcgcttgc 
agccatgaaa 



gggaaaaagg 
cgtacattgg 
cacacatgta 
taaaacttat 
cagccgaaga 
cttcgcgtcg 
aaataccgca 
cggctgcggc 
ggggataacg 
aaggccgcgt 
cgacgctcaa 
cctggaagct 
gcctttctcc 
tcggtgtagg 
cgctgcgcct 
ccactggcag 
gagttcttga 
gctctgctga 
accaccgctg 
ggatctcaag 
tcacgttaag 
gcacgcttaa 
gcttagtgca 
tttctagcta 
aattcttcca 
tgtctagctt 
gcagcgacat 
gtaagcacta 
gtttcattta 
gctggaccta 
atgtcgatcg 
aattgcagtt 
gtgacttcta 
, tcgttgatca 
tcaatatcac 
agcaacgtcg 
acttcggcga 
tgcgtaacat 
tgcttggatg 
accgccactg 



tcgaaaaggt 


10200 


gaaccggaac 


10260 


agtgactgat 


10320 


taaaactctt 


10380 


gctgcaaaaa 


10440 


gcctatcgcg 


10500 


tcaggcgctc 


10560 


gagcggtatc 


10620 


caggaaagaa 


10680 


tgctggcgtt 


10740 


gtcagaggtg 


10800 


ccctcgtgcg 


10860 


cttcgggaag 


10920 


tcgttcgctc 


10980 


tatccggtaa 


11040 


cagccactgg 


11100 


agtggtggcc 


11160 


agccagttac 


11220 


gtagcggtgg 


11280 


aagatccttt 


11340 


ggattttggt 


11400 


aaataataaa 


11460 


tctaacgctt 


11520 


gacattattt 


11580 


actgatctgc 


11640 


caagtatgac 


11700 


ccttcggcgc 


11760 


catttcgctc 


11820 


gcgcctcaaa 


11880 


ccaaggcaac 


11940 


tggctggctc 


12000 


cgcgcttagc 


12060 


cagcgcggag 


12120 


aagctcgccg 


12180 


tgtgtggctt 


12240 


gttcgagatg 


12300 


tcaccgcttc 


12360 


cgttgctgct 


12420 


cccgaggcat 


12480 


cgttccatg 


12539 



